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Preface 



This proceedings volume contains 6 invited state of the art lectures and 50 contributed papers 
presented at OR 98 held at the Swiss Federal Institute of Technology (ETH) Zurich, 31 August - 3 
September 1998. This 4-annual international conference was jointly organized by the Swiss, German 
and Austrian OR-societies and co-sponsored by SIGOPT. The conference had 450 particpants from 
36 countries around the world. 

The proceedings of this conference are subdivided according to the eleven sections of the conference, 
namely 

- Continuous Mathematical Optimization (8 papers), 

- Discrete Mathematical Optimization (12 papers), 

- Stochastic Modelling, Optimization k Simulation (4 papers), 

- Econometrics k Statistics (3 papers), 

- Mathematical Economics, Game Theory k Decision Theory (3 papers), 

- Banking k Finance (2 papers), 

- Operations k Production Management (3 papers), 

- Energy k Ecology (1 paper), 

- Telecommunication (4 papers), 

- Logistics k Transportation (10 papers), 

- Fuzzy Systems k Neural Networks (3 papers), 

- GOR Diploma Prize (1 paper). 

The number of papers included in this volume reflects well the strength of the corresponding sections 
at the conference. Thus, we observe that the intellectual mainstream in Operations Research still is 
mathematical optimization and its application in logistics and transportation systems. In addition 
to these traditional areas the conference emphasized on potential contributions of OR in the new, 
challenging area of system design and management in the liberalized energy and telecommunication 
market. In this new domain of applications we can learn from finance industry how to design and 
price new products and how to manage risk associated with a new product portfolio. These sectors 
(including banking and finance) will definitively become an attractive area for operations researchers 
both from a theoretical and practical point of view. 

We have taken the efforts to insure scientific originality and quality of the included papers by a 
refereeing process. We express our gratitude to all referees, and we thank all authors for delivering 
the final version of their papers in due time. Finally we appreciate the smooth cooperation with 
Springer- Verlag. 
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Chair: H.-J. Luthi 




Challenges for Operations Research as a Scientific 
Discipline at the Beginning of the 21st Century 



Prof. Dr. Dr. F. J. Radermacher 



Introduction 

The paper gives in Chapter 1 a view on global challenges when approaching the next century. We 
will develop into a world with 10 billion people or more, a globalized economy, and an environment 
close to collapse. The importance of technical innovations and suitable world- wide economic regimes 
is worked out. Against this background, in Chapter 2 challenges to science in general are discussed. 
Important topics concern a world- wide and more ecological accounting, global education networks, 
progress in medical care and social infrastructure and contributing to good governance. In Chapter 
3, we give a few hints to the future role of Operations Research. We also point at important 
contributions of Operations Research and related disciplines to topics such as optimization, decision 
making, planning, scheduling, and logistic. In Chapter 4, we finally give some hints to the general 
topic of knowledge organisation, which is gaining more and more importance. This includes a better 
understanding of the internet as a framework for distributing knowledge as well as understanding 
businesses, universities, and even states and humankind as complex organisms and learning systems. 
In this context, different levels of information processing, be it neural-network type mechanisms, rule 
processing, or mathematical models and their interplay, have to be addressed accordingly. 

1. Global challenges and what to do about them 

Since the Rio World Conference on Environment and Development in 1992, the concept of sustain- 
ability has been at the centre of the international political debate. In the language of the Brundtland 
Commission, sustainability means “that current generations should meet their needs without com- 
promising the ability of future generations to meet their own needs”. In this concept, economic, 
social and environmental aspects are closely tied to each other. The principal question is how to 
fulfil human rights and allow development in a world moving towards 10 billion people and one global 
market, while at the same time keeping the environment intact. A good way to make the challenge 
obvious is the so-called MASTER EQUATION: 

B = P X L X T 



(total environmental burden B = population size (P) x average standard of living (L) x technological 
efficiency parameter (T); here T translates a unit of standard of living into resource use and induced 
environmental burdens.) 

The topic of sustainability is most closely tied to the development of a world- wide information 
and knowledge society [36]. On one hand, information and communication technology is the major 
enabler and driver of globalisation. This adds substantially to environmental burdens and also puts 
the social systems and labour systems of the developed countries under enormous stress. On the 
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other hand, information and communication technologies offer completely new solutions for many 
of our problems, from global awareness raising to world- wide education systems and an eventual 
dematerialisation of most of our economic activities. 

The topics of the interplay between economic globalization, sustainability and the informa- 
tion society are deeply discussed in documents of the Information Society Forum of the European 
Commission and the Forum Info 2000 of the German Government [7, 12, 13], for related information, 
see also the recent OECD document [24]. Some major positions from [12] are listed below in the 
form of 10 theses. Major issues discussed concern the concept of dematerialisation [41, 44], what 
it is about and why it offers great hopes. At the same time, the so-called rebound effect [36] is also 
addressed. This effect causes maybe the greatest challenge ahead, because — historically speaking 

— it always seems to eat up all the positive potential for better protection of the environment that 
technology seems to offer by (1) allowing ever more people to live on the earth who, (2) on the 
average, always generate more activities per person. 

The Information Society, today, is at the centre of these processes, driving dematerialisation as well 
as rebound effects. There is available by now a number of hints for possible strategies to maximise the 
sustainability impact of Information Society developments, but it also becomes clear, that a central 
key to overcoming poverty and achieving human rights for all and to sustainability is the 
evolution of global economic and social regimes to better meet urgent world-wide ecological 
and social requirements. Since by now there are no world market prices reflecting adequately the 
scarcity of ecological and primary resources, there is a need for frameworks and institutions enforcing 
suitable price regimes in order to bring exhaustion rates into balance with sustainability. This might 
be implemented e.g. by mechanisms such as world-wide tradable C02 emission certificates, 
where the Kyoto Summit just led to a first step in the right direction. In addition to that, there is 
the need for world-wide social systems, e.g. education systems, which we all have to invest in, as 
raising human resources world-wide is one of the most important instruments for improvement 
available. In order to establish such regimes, it seems natural to integrate these aims into existing 
regimes, e.g. by developing further GATT/WTO, UNESCO, World Bank, and the like. For some 
quantitative insight into such topics and needed developments, cf. [2]. 

Another question is how at the present moment Europe should deal with these issues politically. 
This concerns a double strategy of staying competitive in the world market even if, for the time 
being, this requires actions that are contrary to many of our long-term aims concerning social and 
environmental politics while at the same time fighting for a better world-wide orientation in the 
longer run. That such efforts can be successful, and that Europeans do sometimes have a major 
impact, was just demonstrated in the recent World Climate Summit in Kyoto and its outcome 

- even if it was only a first step. With respect to a long-term perspective, we need to develop - as 
input into the political debate - what might be called a European Vision of global governance, a 
new social contract, sustainability and a better world. 

In the following, 10 central guidelines for the challenges ahead of us are listed, with reference to work 
of the Information Society Forum in Brussels and the German Forum Info 2000 [7, 12, 13]. 
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10 Theses 

1. Securing sustainable development is the decisive challenge for the next century. 

2. Economic globalization brings with it great potential, but also risks for sustainable develop- 
ment. 

3. The question as to whether sustainable development can be attained will be decided during 
the transition to a world-wide Information and Knowledge Society, and will depend on what 
shape that society actually takes. 

4. Economic globalization, the ecological problems to be expected and the current development 
of world population will over the medium term only allow for those societal configurations and 
lifestyles which can be expanded to carry 10 billion people. 

5. Technological progress provides valuable opportunities to find solutions to the challenges we are 
facing. Most important for sustainable development are forms of technical progress which lead 
to an increase in resource productivity (dematerialization). Information and communication 
technology provides an essential contribution in this direction. 

6. Technological progress and dematerialization will not by themselves solve future problems; no 
more so than will isolated changes in the areas of behaviour or social structure. Counterpro- 
ductive feedback effects are extremely important to remember here. Such rebound effects work 
like a boomerang. 

7. The rebound effect in a globalized economy can only be limited if appropriate social and 
ecological ’guard-rails’ are installed world-wide by politics as part of the world’s economic 
system. Such guard-rails have to be implemented locally in the form of national and regional 
legal framework conditions (the subsidiarity principle). 

8. A central challenge for the future is developing a more global partnership in the form of re- 
sponsibility shared in accordance with each partner’s ability to contribute (global governance), 
i.e. joint world- wide acceptance of responsibility for sustainable development. 

9. On the path to a United Europe, we should develop and defend our views on such topics as 
the world trade order, justice, culture, social standards and environmental protection as the 
European Model on the world’s political stage with more determination. 

10. We should regard with an open mind the unavoidable consequences of adapting to the world 
marketplace that are resulting from globalization and the mounting economic power of regions 
that were previously lagging behind. There will indeed be difficulties on the way before us, but 
there will be great potential as well. 
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2. Some possible contributions of science on the way ahead 

On the way to the next century and given the problems described, science is one of the strong forces 
world wide to put hope into. For a number of reasons, science may be able to contribute to solving 
the problems we are facing. Science is a societal sub-system with a great deal of independence. In 
science, independent thinking is still allowed and asked for. Science has an orientation towards the 
truth, and scientists try hard to address topics as unbiased as possible. Still, science is by far not 
really globalized. In particular, most scientific research is still undertaken in the richer countries of 
the world and, to a great deal, driven via market forces. Major efforts are consequently directed 
and invested into special problems of the most developed parts of the world, while often missing 
the broader issues. So there is much scientific research devoted to the health needs, consumer 
requirements, sporty activities and so forth of a rich and relatively old population in the North in 
spite of the severe and quite different needs of young and poor generations all around the globe. Still, 
if there is a hope at all for change, science has a major role in it and certainly, what science is doing 
will change the landscape we will be living in for the next decades to come. 

In the following, we discuss some topics, that seem to be of particular importance. A major issue 
concerns the question of how we get our economic accounting systems into a form where social 
and ecological considerations are better taken care of then today. Today what destroys social and 
ecological capital often adds to the Gross National Product (GNP) which, therefore, often gives us 
a wrong indication of where we stand and should go. We need better indicators and better 
accounting systems, we need better frameworks and somehow have to couple the forces of the 
market with better orientation lines to serve the interest of the people and the ecological system 
better. 

In this context we also need more investments into global medical care systems and social infras- 
tructure, into world- wide pension systems and into global education networks [34]. Concerning 
this issue we refer, as one example, to the ’Health for all’ scientific framework building process of 
the World Health Organisation (WHO) [3, 10, 45, 46], where FAW was strongly involved concerning 
the aspects of framework building, planning methods, tools, and usage of global information net- 
works; see Fig. 1 for a visualisation aid developed [3]. Investments into global social infrastructure 
aims to improve the standard of living for many people in the world and, at the same time, hope- 
fully to induce a different generative behaviour. Changing this behaviour might eventually and 
hopefully bring us back on a course where the world population decreases instead of always 
growing [2, 6, 36]. 

What we then also need is a science for good governance and a better way of making use of, 
distributing and handling knowledge. In this respect one statement of the industrial declaration 
of the minister conference in Bonn in July 1997 [11] gives an important direction of where to go: 
“The global players - private and public enterprises together with governments and international 
organisations like the EU and the G7 should constitute a “Global Information Superhighschool” 
(a multimedia networked (high) school) for global wide sustainability, as a new concept for 21st 
century education. An adequate organization form for the Information Superhighschool, as far as 
global distribution, delivery, and co-operation with local educational institutes are concerned, would 
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have to be studied upon.” 

We finish with listing 3 more important scientific challenges for the next decades, which all offer inter- 
esting opportunities for using (non-classic) mathematical models and new algorithmic tools. These 
are the Human Genome project, the further development of multimedia and computer 
networks, which may develop into something like a nervous system for humankind and into the ma- 
jor platform for all kinds of activities in society, business, and learning, and, finally, the development 
of service robots for all kinds of every day life tasks. 

The research into service robotics, where FAW is strongly involved [8, 16, 28, 29, 42], also offers new 
opportunities for better understanding the functioning of intelligence, cognition and consciousness. 
All of these developments have the potential to dramatically change the frameworks and conditions 
under which we will live and operate in the future. 

3. Some comments on the future of Operations Research 

Operations Research has existed as a vital academic discipline now for more than 50 years while at 
the same time extending more and more its range of applications [33]. In a broad sense, it stands 
for decision making and optimization, and it has by now found its way into almost all kinds of 
application areas such as industries, administrations, and the traffic, health, and military sector. 
Typical features of Operations Research are: 

• a broad range of models considered 

• a broad range of objectives taken care of 

• elaborated algorithmic approaches and systems support. 

From the very beginning Operations Research — although finding positive response — also had to 
cope with considerable problems. Generally, implementing scientific results into practice is not easy, 
and this is even more true in an area which deals with real decision making [30], e.g. which is related to 
the human factor, power struggles, etc. In a sense, the contributions of Operations Research address 
a field in which humans do not always work openly to reach officially set targets, and in which hidden 
agendas, non-transparency and personal interests often play a decisive role. On top of that, 
data requirements are often enormous and counter-productive. Concerning the last point mentioned, 
the situation has by now improved considerably. Here, Operations Research has benefited enormously 
from a close co-evolution with the development of computers and networks, parallel to new and broad 
disciplines, such as computer science. As a consequence, today much of the data needed is routinely 
available and automatically processed, there are common data interchange facilities along the added- 
value chain, and there are much more powerful computing resources available, even at single work 
places. Through communications infrastructures, data and results can be shared within companies 
and world- wide. This by now extends far beyond data exchange into the direction of sharing of 
algorithms, models and knowledge, partly using the world-wide web as a general infrastructure for 
access to models, algorithms, and computing resources [14, 20, 21, 22]. 
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(a): The developed countries 




(b): The developing countries 

Figure 1: Visualisation of the health status of the developed and the developing countries by means 
of the “Visual Health Information Profile (VHIP)” 
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The progress in recent years has brought with it many changes for Operations Research and many 
other academic fields. This development has also led to a rapid growth of some new and/or spe- 
cialised disciplines dealing with related topics, sometimes also covering parts of the earlier grounds 
of Operations Research. Concerned disciplines are: 

• cybernetics / systems theory 

• applied / numerical mathematics, combinatorial optimization, industrial mathematics, scien- 
tific computing 

• computer science, applied computer science (with emphasis on algorithms, data structures, 
complexity theory) 

• decision theory (decision analysis, decision support systems, game theory) 

• computer science in economics, in the traffic and health sectors, in administration 

• classification and terminology as scientific fields 

• artificial intelligence, neural networks, and fuzzy systems 

• engineering, simulation. 

In the last 50 years. Operations Research has proved to be surprisingly vital with respect to all kinds 
of developments. This ability to stand up in a difl&cult environment results from the broad scope of 
ideas and tools that Operations Researchers bring into the game, along with their willingness to 
be close to users. This means being able to understand the practical problems of such users, and 
to bring into this process a broad theoretical insight into the principal nature of many problems 
in a decision- or game-theoretic context or in optimization, the knowledge about what models, what 
objectives, what tools, what type of computer support should be used and in which way to really help 
them. The willingness of Operations Researchers to concentrate on sophisticated and non-standard 
modelling aspects and to deal with all kinds of models and objectives, should be particularly noted 
here. 

There is every chance and indication that this picture will prevail also in the future as long as 
Operations Researchers remain willing to build up the knowledge required, use a broad range of 
models, objectives, and tools, and really try to understand the needs of the users in those applied 
tasks in which sophisticated modelling and computer support is needed to give adequate answers. 
These are fields in which the particular problems are too sophisticated in nature to be dealt with by 
standard computer-literate application-oriented people using tools from libraries on their own. There 
is every chance that with the transition to the information society and building up of the information 
super-highway, the role of Operations Research will grow further, as Operations Research can enrich 
the content side by methods that help in the generation of new knowledge, for example, in the context 
of so-called filters and brokers. More information on this field is given in Chapter 4. 

In the following we briefly mention a number of application areas where Operation Researchers should 
continue to find interesting areas of participation. 
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1. Optimization will gain further in importance 

We see an increasingly higher importance of optimization tasks of all kind. This is due to the 
fact that problem sizes become bigger all the time. For instance, new technology means 
that we generate more data and deal with greater communication networks. Globalization 
means that we always have to cover greater geographical areas. Miniaturization means that 
there is more and more functionality to be integrated on one chip. And in a world full of 
needs and poverty there is no space for wasting resources or opportunities. Consequently, more 
optimization needs and continually larger problems to deal with are a typical picture as is the 
need to always do a better approximation where problems are too hard to be typically solved 
to optimality. For a recent application in the field of making information on public transport 
available, see [39]; for overviews concerning the field of optimization, see [1, 23]. 

2. Planning, scheduling, logistics will be employed in more fields 

These topics may be seen as a special case of 1, but are of particular importance for all kinds 
of business activities and consequently start to be addressed within huge business software 
packages. We just mentioned that the use of the global positioning system (GPS) as a 
means to bring on-line information concerning the place and time of interested objects, such 
as cars, into systems for all kinds of processes, will change the picture dramatically. This is 
true also for the usage of all kinds of other sensors and will in the future also be true for still 
more actuators. For instance: imagine hundreds of millions of cameras being on-line connected 
to the Internet. Consequently, there is more and more eed for good planning, scheduling 
and logistics and there is always better data and a better computation and communication 
framework available to deal with such issues. For some general references in the field, see [4, 
9, 26]. 

3. Formal decision making, preferences, game theoretic aspects come strongly into 
the picture 

Decision theory and game theory are important areas of work for Operations Research [17, 
18, 19]. Still, real-life applications are less frequent than should be expected. There are many 
reasons for this, that deal with power aspects, hidden agendas, data deficiencies and so forth. It 
is interesting to see, though, that often electronic agents start to behave like ‘subjects’, possibly 
making deals for us, making proposals for us as in personal travel assistants (PTA) or being 
the basis of characters in games, movies and so forth, or part of the control system of 
future service robots [15, 16]. This means that within such systems, we in the future have 
to deal more with (subjective) preferences, be it our own preferences or the preferences of 
machines. We, therefore, have to organise decisions in those agents and we have to take game- 
theoretic constellations, like prisoners dilemma, into account. So the route that society goes 
means we get more situations where the type of sophisticated knowledge available in fields 
such as decision theory or game theory, will be needed broadly and can be involved without 
the practical obstacles, typical at present. 

4. Education, investing in human resources and capacity building are crucial issues 
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For Operations Research as a discipline it will become important to make our own knowledge 
available over networks in courses for education, be it primary education or be it education 
directly at the point of learning in companies or at home. This will be an important con- 
tribution to world-wide needs and also an important contribution to making the skills that we 
stand for more broadly available [34]. By doing so, we will also strengthen our own community. 

5. Neural networks, learning systems and genetic algorithms open up a new paradigm 

It has become apparent over the last few years, that our skills have to also be used in more 
unstructured domains. This concerns areas such as data mining of huge data collections or 
arguing about the consumer behaviour of consumers based on knowing the patterns of their 
credit card use. In these contexts, the automatic updating and learning of systems 
structure based on ongoing information is necessary. Mathematically, the task falls into 
the field of non-linear optimization and approximation theory, where genetic algorithms and 
other tools, such as neural networks [25], offer new potential. There is a broad area of 
topics to be addressed. In some cases data will have to be complemented with other data 
sources. These might for instance be of a geographic nature. In this context, GEOGRAPHIC 
INFORMATION SYSTEMS (GIS) have often turned out to offer powerful means to do 
knowledge integration over different knowledge sources [40, 43]. 

6. Working with multimedia content 

In a multi-cultural, multi-lingual world with huge differences in education and development, 
multi-media products of all kinds, e.g. advertisement, sport events, soap operas and so forth, 
are gaining increasing importance as a general means of communication to all people. A major 
problem is, however, how to process, find, index, maintain and retrieve the respective 
content. Here optimization and other techniques are needed for encoding and for dealing with 
such content, particularly content from sensor data that is not symbolical in nature. This 
particularly concerns images, video films and so forth. One major issue concerns the question 
of how to automatically get short abstracts of content. Among other aspects, this may lead to 
a modern way of documenting meetings. Making anonymous parts of the information in such 
a framework (in particular de-personalisation) might turn out to be another important topic. 

4. Knowledge processing as an integrated and holistic task 

One of the biggest challenges ahead is more effectively dealing with knowledge. Knowledge is 
gaining more and more importance as the centre of value added processes. Still, we often get stuck 
with discussions about where the difference is between data, information and knowledge. Particularly 
in a classical philosophical tradition, knowledge is exclusively related to information that is explicitly 
processed by human brains and finally understood there. However, we observe today, that more 
and more often, machine centred contributions that turn out to be part of value added processes 
that until recently used to be knowledge (i.e. that had to go through human brains) are becoming 
practically relevant. 

In a very general sense, therefore, it seems to be important to separate the knowledge concept from the 
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necessity that a human brain is involved. If we do that we mentally disclose new knowledge resources. 
This concerns at least two areas. The first is the area where humans have all the knowledge, but 
cannot talk about it, because the knowledge is in the form of a senso-motoric function. The second 
area concerns new ’knowledge’ intensive applications, in which humans are eventually not involved 
any more. We mention here as examples for the first type of knowledge the ability of a particular 
human to drive a car, design a house, manage all kinds of emergencies full of unexpected occurrences. 
We mention similar phenomena in companies, where self-organization is often the most important 
ingredient to success but still this is not adequately taken into account on the planning level of the 
company, because it is so difficult to talk about. Concerning the second aspect, we would better 
understand where knowledge sits in machines, be it in robots or be it in the world wide web. All 
this becomes important in value added processes. And more often than not, humans are by-passed 
this way, be it for good or for bad. 

To take such a very general point of view is a major ingredient of FAW’s work in the field of knowledge 
processing over the last decade. As a basis for this work, FAW has developed a four-level architecture 
of knowledge processing to better deal which the processing of knowledge [31, 32, 35, 38]. This four- 
level architecture is now also the basis of the DFG Sonderforschungsbereich No. 527 titled 
“Integration of symbolic and subsymbolic information processing in adaptive sensorimotor systems” , 
which is a joint effort of the University of Ulm, the Daimler Benz Research Centre in Ulm and FAW, 
and which has been in operation since the beginning of 1997. The main application area addressed is 
service robots; on the content side, there are connections to the topic of (artificial) consciousness 
and creativity [27, 31, 32, 38]. Another important topic is the interplay between the subsymbolic 
(neural) and the symbolic part of knowledge processing and connections to brain research [5, 25, 
31]. Also, the FAW knowledge management method for organisations [35] is based on this approach. 
It is now the basis for consultancy work with major companies, understanding them as complex 
organisms and sensomotoric systems. Particularly, if we consider companies in the transportation 
area or in the construction field it is interesting to study where the sensoric part and where the 
actoric part are to be found. Finally, one can also take a general look at human kind as an intelligent 
body, as done in Chapter 1 of this paper. 



4.1 Knowledge: what is it and how is it represented? 

In the following section, an attempt will be made to classify what role is played by knowledge, 
not only in the context of systems generally, e.g. biological systems, but just as much in complex 
machines or in organizations or large structures where people and machines work together, so-called 
superorganisms. In doing so, a four-level architecture of knowledge processing developed 
at FAW will constitute a basis which, in principle, enables the inclusion of vastly different forms, 
systems and areas of application of this kind and which seems, at the same time, to be capable 
of functioning as a store of all relevant forms of knowledge. This four-level architecture is in this 
sense the basis for many concrete applications and allows new approaches to this whole domain. We 
distinguish in this perspective four different levels of knowledge (cf. also Fig. 2) . 
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1. Signal level 

At the lowest level we observe a direct interaction of a physical nature between systems and the 
surrounding world. Signals, in particular are part of this. Signals are picked up or identified 
by sensor systems, inducing, on the one hand, direct effects and, on the other, serving as a 
source of features, which are an initial form of information-compression. Certain sensors 
are directed (primarily) to the world outside the system, others primarily to the system’s 
inner life. 

2. Feature level 

Features are the input-information of the second level of information processing under con- 
sideration. Here, on the one hand, begins a functional transformation of such features, for 
example, into motor response or actuation, perhaps in the form of trained (artificial) neural 
networks. On the other hand, objects or concepts are identified by classifiers on the basis 
of features. 

3. Symbol level 

The symbol level is concerned with objects or concepts and deals with them by means of 
various mechanisms of symbol processing (the processing of rules, logic, relational systems, 
etc.). This is the classic domain of Artificial Intelligence (AI) as a scientific discipline. 

4. Theory level 

From the third level one can, in certain domains, as necessary, go on to the fourth level of 
theories and models and can then, for example with the aid of mathematical methods from 
the fields of optimization, statistics, decision theory and numerics, make assertions and come 
to conclusions, on the model level, which can be re-translated into concrete behavior via the 
other levels involved. 
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To illustrate the relationship of effects on these various levels, corresponding steps in human infor- 
mation processing can be referred to: if, for example, someone is pricked with a pin, the skin first 
moves inwards and the arm backwards - this is the level of direct physical reaction (level 1). But, 
at the same time, the person concerned (that is, his brain or his nervous system) can also filter 
from the accompanying signal stream information or features which indirectly convey the message 
that something is obviously pricking him. These recognized features flow (on the second level under 
consideration) into a biological neural network, which induces a motor response. The arm, for exam- 
ple, is withdrawn in a reflex movement. This happens without the person concerned being at that 
moment aware that something has pricked him. However, on the next higher level of abstraction 
(third level), the person can also recognize the situation, that is, can conceptionally classify it as a 
situation in which someone has pricked him. On this level, a person can, if necessary, analyze the 
situation and draw from this an appropriate form of behavior. Finally, a person who is a scientist 
can, on a further higher level (the fourth level) concern himself with a theory which describes exactly 
what happens when a pin pricks his skin and how the stream of signals travels through his body 
in terms of time or intensity. To this end, the signal streams are possibly described by systems of 
differential equations and, finally, solutions for these systems are examined. 

With a view to this general approach, it is then of interest in all kinds of applications to establish 
what tasks can be tackled, with what mechanisms and on what level. 

In terms of intellectual capacity, the total range of achievement on all the four levels just described 
exists in brains or nervous systems of human being/mammals, incidentally, as a whole unified neural 
architecture, the possibilities of which - exaggeratedly expressed - can also be misused to do things 
for which they were not intended originally, such as symbolic information processing. 

The basic capability of a biological neuro-system consists of the ability to establish relationships 
between certain phenomena in the form of learning from example. In content, this corre- 
sponds to questions of function approximation in mathematics. There are meanwhile mathematical 
theorems which precisely describe this basic potential of neural networks. It is worth mentioning here 
that the existing pointers to biological systems can be carried over to (biological) superorganisms, 
e.g. insect colonies, swarms, states and also to humankind as a system. 

4.2 Concerning the incorporation of the four-level architecture in other fields of appli- 
cation or other areas 

In a similar way as with biological systems, including superorganisms, one can also study other, 
non-biological systems based on this theory and this is particularly interesting in the context of this 
text. So, for example, one can study service robots, human-machine systems and businesses from 
this perspective. 

As an example of an application to a business, think about elements of knowledge which work 
together on the various levels of interaction in the context of design processes as part of the 
work done in certain enterprises. Level 1, the lowest level, concerns, for example, the “nature” and 
“atmosphere” of buildings, door heights, room sizes, fittings with an infrastructure, etc. Level 2 
concerns mechanisms, indirectly derived from and not directly gained from arguments of cause and 
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effect, for the organization and improvement of communication, for example, the division of units 
and teams (into work groups, projects, departments, etc.), the installing of copiers, the positioning 
of central stairways, a generously-sized cafeteria, the placing of plants and works of art, etc. Also, 
essential for this level is dealing as a whole with questions of form, color or design. Knowledge 
level 3 concerns the recording of regulations and calculations or organization processes and sets 
of rules which specify how persons in a design team have to perform their jobs in co-operation 
with others. Here are also gathered together the (explicit) legal requirements, auditing procedures, 
standards, ISO rules etc. Finally, on the fourth level of mathematical models, models are grouped 
together showing how certain decision-making processes are to be carried out as part of departmental 
interaction or how certain designs are to be tested for functionality. 

It should be mentioned that the main emphasis and the most important effect factors on each level 
are very heavily dependent on the size of the firm or organization in question that one is trying 
to describe. Investigations, for example, have shown that, for the single person, one of the most 
important starting points for making changes in processes is motivation, something which is very 
heavily dependent on levels 2 and 3 of the hierarchy referred to. For groups, perhaps the most 
important factor is the form of organization in each case, i.e. the organization of the mechanisms 
of interaction among those participating. This aspect is in substance very heavily concentrated on 
levels 1 and 2. For larger associations or enterprises, the availability of information seems to be 
a very central success factor. These are mechanisms which are concentrated for the most part on 
levels 2 and 3, but in character are very different from motivation for the individual. 

It should, finally, be mentioned that even in academic disciplines, something like the different levels 
of knowledge considered can be found. E.g. in Operations Research or optimization, methods such 
as simulated annealing, greedy methods, and local search procedures would belong to level 2, a 
deep classification of different cases within a broader field with respective methods and procedures, 
as available in scheduling or logistics, belong to level 3, and theories about optimality of certain 
methods in certain cases or convergence of procedures under certain constraints belong to level 4. 

Summary: Whether we deal with robots, enterprises or societies, information processing is always a 
difficult and crucial issue, having to do with humans and technical components, codes and pragmatic 
issues, the issue of symbol grounding, performance and so forth. In all cases, it seems important 
to look for sensors, actors and intermediate forms of information processing. Knowledge may (1) 
use the ‘key-lock’ principle, (2) consist in the approximation of certain functions via forms of self 
organisations, e.g. in neural networks, (3) maybe available in the form of a von-Neumann machine like 
processing of rules, constraints and so forth or (4) could mean the development or use of mathematical 
models. In this context, the global information super highway may be seen as a nervous system of 
human kind, generally offering more feedback between humans as an element of self organisation. 
Search Engines and corresponding filters and brokers are knowledge organizing devices, partly rule- 
based and symbolic in nature (level 3), but partly driven by human interests in topics, words, 
key-phrases and so forth, that are more positioned on level 2 of the knowledge hierarchy discussed. 
Certainly, phenomena such as information overload require better approaches towards the organi- 
sation of content. This is, by the way, also true for scientific disciplines if they mature. A good 
organization of content must be based on general world models, including controlled vocabulary, 
thesauri and so forth, based on level 3 and 4 concepts. Some hints to such questions, for the field of 
environmental information systems, can be found in [40, 43]. More information on priorization and 
arbitration is given in [15], more information on relations to topics such as creativity and innovations 
by humans and in organisations is provided in [32, 37]. 
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There are many challenging things to do. The Operations Research background in its full scope has 
a lot of input to offer, when moving into the next century in a world full of prominent and unsolved 
problems. 
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Summary: In “classical” optimization, all data of a problem instance are considered given. The 

standard theory and the usual algorithmic techniques apply to such cases only. Online optimization 
is different. Many decisions have to be made before all data are available. In addition, decisions once 
made cannot be changed. How should one act “best” in such an environment? 

In this paper we survey online problems coming up in combinatorial optimization. We first outline 
theoretical concepts, such as competitiveness against various adversaries, to analyze online problems 
and algorithms. The focus, however, lies on real-world applications. We report, in particular, on 
theoretical investigations and our practical experience with problems arising in transportation and 
the automatic handling of material. 

1. Introduction 

It is a standard assumption in mathematics, computer science, and operations research that problem 
data are given. However, many aspects of life are online. Decisions have to be made without knowing 
future events relevant for the current choice. Online problems, such as vehicle routing and control, 
management of call centers, paging and caching in computer systems, foreign exchange and stock 
trading, have been around for a long time, but no theoretical framework existed for the analysis 
of online problems and algorithms. Correspondingly, only minor research efforts were spent on this 
topic. 

The situation has changed considerably in the recent ten years. Both, the introduction of competitive 
analysis and the increasing economic importance of online problems have fostered this development. 
The first books on online computation, see [13, 15], and several survey articles, see [1, 2, 8, 22, 27] 
appeared recently and made the achievements and the weaknesses of the current theory known to a 
broader public. 

This paper is not another comprehensive survey. It only briefly outlines some parts of the relevant 
theory. It concentrates on concrete applications of combinatorial online optimization, sketches the 
(rather poor) theoretical knowledge on these problems and describes what is done in practice for 
their solution. We report in particular on the control of elevators, the routing of stacker cranes, and 
the assignment of orders to commissioning vehicles. 

2. Theory and Reality 

We imagine to be in the following situation. We wish to optimize a certain objective function (in 
this paper we will always minimize) and have to make decisions before we know all data. We call 
such a scenario an online problem. We describe such a problem by a sequence of data (in online 
optimization one usually speaks of a sequence of requests) and assume that, after a request arrives. 
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a decision has to be made before the next request appears. An algorithm acting this way is called 
online algorithm. It is, in general, also assumed that a decision cannot be revoked and that the 
number of requests coming up is unknown in the beginning. (Variants of these concepts will come 
up later.) 

Let Cx{(t) denote the value of the objective function, called cost from now on, that algorithm 
X produces when acting on the request sequence cr. (For notational convenience we will assume 
throughout this paper that all values Cx{a) are strictly positive.) If we have two algorithms A and 
ADV, both acting on cr, we can compare their results. We say that algorithm A is c-competitive with 
respect to algorithm ADV if there are real numbers c > 1 and b such that 

C'a(ct) < C • CADv(or) + b 

holds for all sequences of feasible requests a. The typical situation here is that we would like to 
analyze the quality of an online algorithm A with respect to another algorithm, called adversary and 
therefore denoted by ADV. The adversary is not necessarily an online algorithm and may know the 
request sequence in advance. Even worse, ADV may produce the request sequence in such a way that 
ADV’s solution is very good and A’s very bad. 

A worst possible adversary is an algorithm that computes an optimum solution. Making this precise 
needs some care. It is sometimes the case that a “natural” offline problem is associated with an online 
problem, i.e., that, ignoring the arrival times of the requests, the request sequence defines a standard 
optimization problem, the so-called offline version of the online problem. If OPT is an algorithm 
that solves this offline problem to optimality then an online algorithm A is said to be c-competitive 
if it is c-competitive with respect to its optimal offline adversary OPT. 

However, it is not always clear what the offline version of an online problem is since the k-th request 
in a sequence may depend on what the online algorithm has done on a previous request. We will 
discuss this matter further in Section 6. 

So far we have (implicitly) assumed that our algorithms act deterministically. In the context of 
online problems and algorithms it is natural to judge the quality of an algorithm by its expected 
performance and, moreover, to use randomized online algorithms. The concept of competitiveness 
can be defined for such settings analogously. Here it is also common to distinguish various adversaries 
with which the expected cost of a deterministic or randomized algorithm is compared. 

An oblivious adversary is an adversary that acts independently of the decisions of the online algo- 
rithm while an adaptive adversary generates the next request depending on the answer of the online 
algorithm to the previous requests. The adaptive adversary may compute, in the end, an optimum 
offline solution - in this case it is called the adaptive offline adversary - but this adversary may also 
be required to act online and make a decision whenever a new request has been generated. This 
adversary is called adaptive online. The costs incurred by the oblivious and the adaptive offline ad- 
versary are of course the (expected) offline costs while, for the adaptive online adversary, there is no 
precise characterization of the cost generated. One simply takes what the algorithm has produced. 
One can prove that the competitiveness of an online algorithm with respect to an oblivious adversary 
is not larger than that against an adaptive online adversary and that this is not larger than that 
against an adaptive offline adversary. 
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We remark here that we have not made any requirement on the running time of either an online 
algorithm or an offline adversary. In fact, in many cases the offline versions of online problems are 
jVT^-hard, and thus, we may compare a fast online heuristic with an algorithm that has exponential 
or worse running time. Competitive analysis ignores this aspect. 

In practice, however, online algorithms are usually required to produce solutions “quickly”. What 
“quickly” means depends on the application considered. The usual requirement is that an answer 
must be provided within a fixed time bound (which may be in the range of microseconds or even 
minutes if the process is somewhat slow). Whatever the bound is, the task is to find a “best possible” 
solution within the given time frame (and the available computing machinery). We call algorithms 
that are online and obey such additional requirements real-time. No (sufflciently interesting) concept 
for the performance evaluation of a real-time algorithm is known. 

We will demonstrate that competitive analysis provides an overly pessimistic picture, i.e., an algo- 
rithm may not be competitive at all or may have a poor competitiveness factor c while its actual 
performance appears much better. Such an observation cries for average case analysis. However, this 
is a rather unexplored area of research. Stochastic analysis of an online problem is the next natural 
idea. The typical approach here is to view the given online problem as a stochastic program where 
the requests occur according to some distribution. An outcome of stochastic analysis would be an 
estimate of the average behavior of an algorithm that is based on solutions of the stochastic program 
or, even better, an estimate of the expected online cost of a given problem. A critical discussion of 
some aspects of this approach can, e.g., be found in Appendix B of [13]. 

Queuing theory addresses online problems as well. However, the focus of this approach is different. 
One considers, e.g., a given system (including the algorithms that control the system) and assumes 
that a sequence of requests arrives according to some distribution. Queuing theory tries to estimate 
important parameters of the system such as average service or waiting times. The goal is the 
description of the state and not the optimization of the process, see [3] among others. 

If the theory of online algorithm looks so bleak and gives no guidance for practice what should be 
done in case of a real application? The answer of practitioners is: simulation. In fact, we have no 
better answer either and are actually using simulation to evaluate and compare online algorithms 
for combinatorial optimization problems that occur in practice. A discrete event oriented simulation 
program, called AMSEL [4], has been developed by N. Ascheuer that is particularly suited for a 
quick simulation analysis of logistics systems. All simulations discussed later have been executed 
with AMSEL. 

There are (at least) two ways how simulation is used. For a system (such as the transportation system 
within a factory) to be designed, one sets up a simulation model that refiects the basic properties of 
the system to be built. Making various assumptions about the demands of this system one designs a 
number of “reasonable” or even “extreme” scenarios to see how the system behaves, whether waiting 
times, throughput times, etc. are acceptable, and if not, one modifies the system so that the expected 
standards can be achieved. Once such systems are built they often show different than the expected 
behavior. Then, instead of rebuilding or technically changing the system, one tries to optimize some 
or all of the components of the systems to achieve better results. Here online algorithms come into 
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play. Typically several online algorithms are coded and, using real data of the past, their practical 
performance is analyzed by simulating the system where the new online algorithms replace the old 
procedures. Such comparisons, based on real system data, are used to judge whether the use of 
online algorithms pays and, if so, which algorithms to choose for the use in practice. 

We regret that we still use this old fashion trial-and-error approach. It seems, however, that we have 
to, at least at present, since we know of no mathematical tools that could give us better guidance 
for real-world decisions. 

3. Modeling Issues 

We would like to outline, by means of a few examples, modeling issues that come up when online 
problems are analyzed mathematically. 

Let us begin with one of the most studied problems in the “online world” : bin packing. We assume 
here that we have a (potentially unlimited) number of bins of size a. Moreover, there is a sequence 
of items of size aj,0 < a* < a, that have to be packed into the bins. The goal is to use as few bins 
as possible. (This problem in AfV-haid.) 

In the online problem we have to pack an item into a bin as soon as it arrives (and before the size of 
the next item is known). See, among others, [18] for a survey on online bin packing. In the standard 
offline problem one assumes that the whole sequence a = (ai, . . . , Un) of item sizes is given and that 
these have to be packed optimally. Let Copt (c^) denote the minimum number of bins needed. We 
would like to know how online bin packing algorithms behave with respect to this offline adversary. 
We first address lower bounds, i.e., we ask whether there is a constant ci such that no online algorithm 
can guarantee a competitiveness of c/ or below. 

Here is a simple observation. Let A be any online algorithm for the bin packing problem. Let us try 
to find a c/ such that 

Ca(<j) < Q • Copt(c^) 

cannot be guaranteed. We choose some positive number h. Let a = 2hhe the bin size, and consider 
the sequence /i — e, /i — €, /i -f €, h -h e, (where e > 0 is an arbitrarily small constant) of four item sizes. 
A must put the first item into bin 1. If A puts item 2 into bin 2 then the adversary stops. A has 
used 2 bins, the optimum is 1 bin and hence c/ > 2. To achieve a better competitiveness A must 
put item 2 into bin 1. But then items 3 and 4 must go into bins 2 and 3. Hence, Ca(ct) = 3 and 
Copt(c^) = 2, which shows c/ > 1.5. 

Taking now also the additive constant into account and using input sequences consisting of n items 
of size h — e and n items of size h-\- e one can similarly show that 

Ca(<7) < Cl • Copj{a) H- b 

cannot be guaranteed for q = | by any online algorithm whatever choice of the constant 6 is used. 
Using very clever refinements of this simple input sequence (and further ideas) this lower bound has 
been improved by various authors. The current best lower bound is q = 1.540, see [26]. 

Let us now look at the most trivial online algorithm for bin packing one can think of: NEXT FIT (NF). 
NF has an active bin (initially this is the first bin). NF tries to put a new item into the active bin. If 
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it does not fit, NF closes this bin and puts the item into a new (now active) bin. Let bi,i = 1, . . . , A;, 
denote the sum of the item sizes packed into bin i at termination. Then we have 6, 4- bi+i > a which 
immediately implies that Cuf{a) < 2 • Copj{a) for all sequences a. 

A little more elaborate are FIRST FIT (FF) and BEST FIT (BF). FF and BF keep all bins active; FF 
puts the next item into the first bin into which it fits, while BF puts the item into one of the bins 
into which it fits and that makes the bin as full as possible. A similarly trivial analysis shows that 
FF and BF are at least 2-competitive. With a lot of technical detail ci = 1.7 can be established. In 
fact, a competitive ratio of at least 1.7 can be shown for any online algorithm that never opens a new 
bin unless the current item does not fit into any of the open bins. These algorithms can be refined. 
The current online champion (a variant of the modified HARMONIC algorithm) achieves ci = 1.5887, 
see [25]. Hence, upper and lower bounds for the competitiveness of online algorithms for bin packing 
are quite close, but not identical. 

In general, it is often not clear what a suitable offline problem is. In the case of bin packing one 
may have doubts whether always an unbounded number of bins can be kept available for the online 
algorithm to choose the best fitting bin, for instance. Therefore, the bounded-space online bin packing 
problem has received some attention. Here at most k {k given in the beginning) bins are kept active 
and - in contrast to the ordinary bin packing problem - one is allowed to repack the active bins. Lee 
and Lee [24] proved that, for any fixed /c, no online algorithm can be better than 1.69103-competitive. 
Galambos and Woeginger [17] designed a (polynomial time) online algorithm that achieves this 
bound, in particular, for A; = 3. Thus, for the bounded-space online bin packing problem, there is an 
algorithm that is optimal with respect to the achievable competitiveness. Moreover, the algorithm 
needs only three active bins for repacking. Keeping more bins open for repacking does not pay - at 
least within this theoretical framework. 

Results of this type are rare. Moreover, in practice the mathematical models to consider are not so 
clear cut as suggested above. The typical issue is that there is a continuing process where, from time 
to time, decisions have to be made that influence various parameters of the process. The person 
controlling or in charge of the process is not “happy” with the performance of the system. The 
question is what can and what should be done, and if the decision process (the online algorithm 
controlling the system) has been changed, in what respects is the new solution better or worse than 
the old one? 

We call this the “modeling issue” and will explain in the subsequent sections what we mean by this. 
The point is that, in general, there is not a single objective that one has to look at; practical online 
environments are complex with many alternatives for success or performance evaluation. 

A second important issue, often overlooked, is the choice of the adversary with which the performance 
of an online algorithm should be compared. The point here is that the choice of an adversary that is 
too “strong” may produce a grossly pessimistic judgement of the quality of an online algorithm. We 
will discuss this issue in detail with respect to online stacker crane optimization where the asymmetric 
traveling salesman problem (ATSP), the ATSP with precedence constraints, and the ATSP with time 
windows are possible adversaries and provide rather different pictures of the performance of our online 
algorithm, see Section 6. 
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A third issue, already indicated above, is the fact that an online problem may not necessarily have a 
natural offline version. This fact displays again in our stacker crane application, see Section 6. Here 
a stacker crane moves objects within an automatic storage system. Moves requested may depend on 
the moves the stacker crane has executed previously. Thus, in this case, there is no offline sequence 
of requests that could be given beforehand. 

4. Online Control of Elevator Systems 

4.1 Problem Description 

The automated pallet transportation system in the distribution center of Herlitz PBS AG [21] (one 
of Europe’s largest producer and distributor of office supply) has to take care of the ffow of pallets 
from/to the receiving docks, the production and commissioning departments, the automated storage 
system, and the delivery docks. This pallet transportation network connects nine factory floors. 
Among the building blocks for pallet transportation are two systems with five elevators. Each 
elevator can carry one pallet. 

In principle, the control system of the transportation network has the task to move items as fast 
as possible without congestion, to observe time windows, due dates and other technical details. All 
this has to happen in real-time. The answer times have to be below one second. There is no clear 
objective that one could formulate mathematically. Nevertheless, it is important that each individual 
component of the transportation system works efficiently. ,We are currently investigating the elevator 
system. How should the elevators move in such an environment? 

A typical approach would be to schedule the transportation requests in such a way that either the 
average or the maximum waiting time (or a combination of the two) is as short as possible. It is easy 
to show that, for these two objectives, there does not exist a c-competitive online elevator scheduling 
algorithm for any constant c. Hence, we decided to look at minimum completion time (i.e., the total 
time needed to serve all requests) for competitive analysis. 

4.2 Mathematical Model and Theoretical Results 

We will start with a more general problem than needed for our elevator application. An instance 
of the online transportation problem OLTP consists of an undirected graph G = (F, E) with edge- 
weights d{e) (e G E) and a distinguished origin vertex o eV. We are also given an integer C > 1 
which specifies the capacity of the server (elevator, vehicle, etc.). In all what follows here the capacity 
will be C = 1. Finally, one is given a sequence = (ri, . . . , r^) of transportation requests. 

In the OLTP each request is a triple ri - (t*,ai,6i), where U is a real number, the time where 
request r, becomes known, and ai e V and bi e V are the source and target of the transportation 
task. It is assumed that the requests in the sequence a are ordered according to their release times 
i.e., that ti < • • • < The server can move at constant unit speed and is located at the start o at 
time 0. 

Given a sequence a of requests, a transportation schedule is said to be valid for cr, if the following 
conditions are satisfied: (a) The server starts its movement in the origin vertex o and, (b) each 
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transportation request in a is served, starting not earlier than the time it becomes known. 

In the sequel let Cf^(a) denote the completion time of the server controlled by algorithm A on the 
input sequence a. We also use OPT to denote the optimal offline algorithm. 

Observation. No deterministic online algorithm for OLTP has a competitive ratio better than 2 
(with respect to completion time). 

Proof: The underlying graph G = {V, E) for the instance of OLTP consists of a path of 2n + 1 
vertices with the origin being the middle vertex at distance of n from each end. More formally, we 
have that V = {vi \ i = -n, . . . , n } with o := vq and E = { ViVi+i \ i = -n, . . . , n - 1 }. All 
edge- weights are equal to 1. 

Assume that A is an algorithm with competitive ratio c < 2. We choose the number n so large that 
2 > 2 - I > c. 

The first request given is ro = (to = 0, ^-i)- We now claim that at time ti = n - 1 the online-server 

cannot be strictly to the right of vertex ui, i.e., on the path from vi to Vn but not on Vi. In fact, if 
the server were to the right, say at distance 5 > 0 to the right of vertex vi, then we could add the 
request ri = (n - 1, i), v^n)- The online server would then need time at least 1 + (5 + n to serve 

the request. This would result in a total time of 2n + 6. On the other hand, the offline server could 
serve ro starting at time to = 0 and then continue to move to the left until it reaches vertex 
at time n — 1 ready to serve the new request ri. Thus, the offline server needs time n and thus 
C'a((^o,^i))/C'opt((^o,’"i)) = + S)/n > 2, which means that A would not be 2-competitive. 

We have seen that at time ti = n — 1 the online server is to the left of vertex vi . We now add the 
request r^ = (n — 1, Vn- 2 i The total time needed by the online server is then at least n — l-h 

(n — 2) = 2n — 3. 

On the other hand, the offline server serves the sequence (ri,r 2 ) by handling ri at time to and then 
immediately moving to vertex Vn -2 which it reaches at time n — 1 ready to serve r^. Thus, we 
have that CoPT((n,^ 2 )) = ^ind CA((ri, r^)/CoPT((n, = 2 - 3/n > c. This contradicts the 
assumption that A is c-competitive with c < 2. □ 

A natural approach for the design of online algorithms is the following REPLAN-strategy. We for- 
mulate it here for the OLTP. However, the REPLAN-strategy is more of a general principle than of 
a specialized algorithm for a particular problem. 

Strategy REPLAN: Whenever a new request arrives, the server completes the current transportation 
move (if it is performing one), then the server stops and does a replan: it computes a new shortest 
transportation sequence starting at the current position of the server and containing all yet unserved 
requests. 

Observation. REPLAN is a ‘^-competitive online- algorithm for OLTP. 

Proof: Let a = (ri, . . . ,r„i) be any sequence of requests. We consider the time tm, when the last 
request rm = (^m,am,^m) becomes known. Let s{tm) denote the position of the REPLAN-server at 
time tm- 

Case 1: The server is not performing a carrying move at time tm^ 
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Figure 1: Screenshot of the simulation system for elevator systems 



The REPLAN server computes an optimum transportation sequence S which starts at its current 
position s{tm) and serves all unserved requests. The time needed for 5 is at most d(s{tm),o) + 
Copt (c’’) 5 since all the requests from a are available for planning at time tm and the server could 
move to the origin o in time d{s{tm),o) and from this point on follow the tour of the optimum 
offline server OPT. Thus, REPLAN will finish its work no later than tm + d{s{tm),o) + Copt(ct). 
Since the REPLAN-server has traveled to point s{tm) there must be a request r = (t, a, b) in a with 
max{d(o,a),d(o,6)} > d{o,s(tm)) from which we conclude that d{s{tm),o)) < Copt(ct). Finally, we 
can use that the offline server can never complete its transportation before time tm, i.e., when the 
last request becomes known. This gives us that Creplan(<^) ^ 3 • Copt(<^)- 
Case 2: REPLAN is performing a carrying move from a to 6 at time tm- 

The server will finish its move at time tm + d(s{tm),b). It is easy to see that the shortest schedule 
starting at b serving all yet unserved requests has length at most Copt(o-) - d(a, b) + d{b\ o), where 
b' is the endpoint of the route taken by the optimum offline OPT. Thus, 

Creplan(<^) ^ + d(s{tm), b) + Copt(c^) — d(o, b) + d{b' , o) 

< tm-^ d{s{tm),b) + Copt(^t) - rf(a, b) + d(6', 6) + d(a, s(tm)) 

< tm -h Copt (o') + d(b\ o) 

< 3'Copt(o')- 

This completes the proof of the 3-competitiveness of REPLAN. □ 

No competitiveness results are known for elevators of capacity larger than one or systems of more 
than one elevator. Further investigations can be found in [10, 14, 19, 20]. 

4.3 Simulation Results and Practical Impact 

For the OLTP we have set-up a parameterized simulation system based on AMSEL [4] (see Figure 1). 
Several algorithms were tested on random data, among them the simple FIRST FIT heuristic. This 
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strategy always serves the closest available request first and hence does not organize at all the 
transportations in advance. This strategy is one of those used in the distribution center of Berlitz. 
The performance was measured not only with respect to the completion time but also to the average 
and maximal flow time. This is the average resp. the maximum time between the time a request 
becomes known and its completion time. 

Although REPLAN is competitive with respect to completion time and FIRST FIT is not known to be, 
both algorithms have almost identical completion times. REPLAN is considerably better than FIRST 
FIT with respect to maximum flow time (about 20% on the average) while FIRST FIT beats REPLAN 
slightly concerning average flow time (about 5%), see Table 1 for some simulation runs corroborating 
these observations. 
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Table 1: Simulation results for OLTP on random data 



The next step in our project is to record real-world data from the Berlitz system in order to validate 
the simulation system and to And out whether any particular structure in the problem data leads to 
special results differing from those obtained by using random data. 

5. Routing of Commissioning Vehicles 

5.1 Problem Description 

In one of the departments of the Berlitz distribution center greeting cards are commissioned. The 
cards (about 4000 different types) are stored in four parallel shelving systems, see Figure 2. In 
accordance with the customers’ orders, the different greeting cards are collected in boxes that are 
eventually shipped to the recipient. Order pickers on eight automated guided vehicles collect the 
orders from the storage systems, following a circular course. The vehicles are unable to pass each 
other. Moreover, due to security reasons, only two vehicles are allowed to be in the middle aisles at 
the same time, whereas three are allowed in the first and last aisle, see Figure 2. (Three vehicles line 
up in front of aisle 2 since they are not allowed to enter.) 

At the loading zone (where vehicle 8 is located in Figure 2), each vehicle is assigned up to 19 orders 
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Figure 2: Screenshot of simulation system for commissioning mobiles 



from an order pool that changes whenever a software system (operating on a higher level) releases a 
new set of orders. A dispatcher decides when to send the vehicle onto the course. After leaving the 
loading zone the vehicles automatically stop at the positions where cards have to be picked from the 
shelf. Each stop position holds up to ten different types of cards. 

There is no straightforward single objective for the optimization of the greeting card commissioning 
system. Each greeting card order is, in general, part of a larger order including, for example, orders 
for pencils, writing pads, staplers, and other stationary. All the parts of an order have to reach a 
specified destination in the delivery zone within a certain time window. The overall order processing 
is organized in a just-in-time fashion. For this reason the greeting cards commissioning has to observe 
certain due dates. On the average, more than two thirds of the orders processed during a day have 
been generated during this day. 

The Herlitz system analysts observed severe deficiencies in their current scheduling system. Con- 
siderable congestion occurred resulting in a violation of due dates. Congestion was viewed as a 
consequence of an “unbalanced” vehicle load which caused a “fast” vehicle (carrying orders with 
only a few stop positions) to wait behind a “slow” one. Congestion was further increased by “human 
factors” . Order pickers waiting in a queue left their vehicles to smoke a cigarette etc. and forgot to 
return in time. It was decided to come up with a new scheduling system that minimizes completion 
time, obeys the due dates and avoids congestion. (For details see [23].) 

5.2 Mathematical Model and Theoretical Results 

For the theoretical analysis it is necessary to provide a mathematical formulation of the problem 
under consideration. We have started analyzing the Herlitz system by considering the following 
Commissioning Vehicle Problem (CVP). 

An instance of CVP consists of a set L = {1,2, ...,A’} of possible stop positions, and a set of 
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empty vehicles v\^. . . each with capacity C. A request sequence a = (ri,r 2 , . . .) consists of a 
chronologically ordered collection of sets of stop positions. 

For a sequence of requests, a solution to the CVP is an assignment for every request to a vehicle 
v{ri) so that the number of requests assigned to each vehicle does not exceed C. The objective is 
to minimize the total number of stop positions assigned to the vehicles. In Kamin [23] it was shown 
that the offline version of CVP is an A/*P-hard problem. Explicit solving of the integer programming 
formulation in reasonable time turned out to be out of reach for commercial software packages. 

In the online-situation we require that request r* is permanently assigned to vehicle v{ri) before r^+i 
becomes known and that the length of the sequence is unknown until the last request comes in. 

Observation. For every request sequence a = (ri, . . . any assignment of requests to vehicles 
needs at most C times as many stops as an optimum offline assignment. 

Proof. Each request rj requires |ri| stops. If ly is the set of (at most C) requests assigned to a 
vehicle v then, clearly, v stops at at most 5Z kt| positions. Hence, for every request sequence and 

Tieiv 

m 

any assignment, the total number of stops is bounded from above by X) k»|- On the other hand, a 

1=1 

vehicle v has to stop at at least |rj| positions. Since C is the capacity of the vehicle, this value 
is at least as large as E This holds, in particular, also for the optimum offline assignment. 

Ti^Iv 

Summing up over all vehicles yields the observation. □ 

This result is due to Kamin [23], who also constructed a request sequence on which every online algo- 
rithm needs at least C times as many stop positions as the offline optimum. These two observations 
yield: 

Corollary. Every online algorithm for CVP is exactly C -competitive. □ 

We view this as a truly strange result which displays the weakness of competitive analysis in certain 
situations. Competitive analysis is unable, in the case of CVP, to distinguish between different online 
algorithms. On the other hand, it is quite obvious that a BEST FIT algorithm will perform much 
better in practice than a “WORST FIT” algorithm or a random assignment. 

5.3 Actual Approach, Simulation Results, and Practical Impact 

Several heuristics that reduce the total number of stops and distribute them evenly among the vehicles 
were implemented. These are versions of the BEST FIT and the GREEDY algorithm, together with 
local exchange heuristics. The computation times of these algorithms are short enough to run in a 
real-time situation. The real-time requirements in this special problem, however, are by far not as 
strict as in the OLTP. 

We implemented a detailed simulation model for the whole commissioning area in which we compared 
our approach to the one used so far. Herlitz provided production data from a period of about six 
weeks, which were the basis for the comparison. The main results are the following: 

• The number of stops of the commissioning vehicles could be reduced by about 6% on the 
average compared to the solution that has been used at Herlitz. 
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Figure 3: Sketch of the factory layout 

• A tour of a commissioning vehicle at Herlitz took about 70 minutes including about 5 minutes 
time spent in congestion. This idle time could be reduced to 40 seconds on the average. 

• Due to a detailed congestion analysis and prediction, changes in the dispatching habits (in 
addition to minimizing the number of stops) the completion times could be reduced significantly. 
On the average, the work load of a day (two shifts of 8 hours) could be completed about 5 
hours earlier. 

• The number of vehicles (and therefore order pickers) could be reduced from eight to six without 
any negative impact on the system performance. 

We conclude that our mix of heuristics — although not distinguished in the competitive analysis — 
whose choice was based on simulation experiments, was the basis for significant improvements in 
practice. The simulation results convinced Herlitz to test a prototype of the simulation program as 
a decision support tool for the dispatcher. 

6. Stacker Crane Routing in an Automatic Storage System 

6.1 Problem Description 

Siemens Nixdorf Informationssysteme AG (SNI) maintains a production plant where all SNI personal 
computers and related products are assembled. All parts are stored in one of six automatic storage 
systems (AUSSs). The AUSSs serve as material buffers between the receiving area and the assembly 
lines located at each side of the AUSS. For each AUSS there is one stacker crane fulfilling transporta- 
tion tasks between the receiving buffer, the storage locations, and the buffers for the assembly line. 
Figure 3 shows a sketch of the factory layout. (For a more detailed description of the layout, see [5].) 
It is, of course, desired that the stacker crane works “efiftciently” . No problems arise at times of low 
work load where the stacker crane typically has to choose which of at most three jobs to perform 
first. However, there are times of heavy work load, e.g., after a system breakdown, when up to 60 
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2 


3 


4 


5 


ATSP°*^^*"® 


11.50 


14.66 


11.33 


10.00 


11.92 


ATSP 


7.36 


6.64 


6.38 


5.89 


8.14 


ATSPP*^ 


8.18 


7.40 


8.10 


7.46 


9.59 


ATSPTW 


8.45 


[8.6,11.3] 


[9.2,11.3] 


[9.7,10.0] 


[10.4,11.4] 


GAP : 


ATSP 


56.2% 


120.7% 


77.5% 


69.7% 


46.4% 


ATSPPR 


40.6% 


98.1% 


39.5% 


34.0% 


24.3% 


ATSPTW 


36.1% 


71.1% 


22.6% 


3.1% 


14.9% 



Table 2: Average unloaded travel time in seconds and gap for various adversaries 



transportation tasks have piled up. To get the production line working, the requested items have 
to be moved fast and it has to be made sure that certain jobs have to be executed within given 
time windows. After a detailed analysis it was decided to minimize the unloaded travel time of the 
stacker crane while observing certain priority rules. The stacker crane is supposed to work whenever 
a transportation task is available. 

6.2 Mathematical Model and Theoretical Results 

An offline version of the stacker crane problem that seems “natural” at a first glance is the asymmetric 
traveling salesman problem (ATSP). An instance of ATSP consists of a complete directed graph 
D = (V, A). Each node in V corresponds to a transportation task, and the arc from v tow has weight 
corresponding to the travel time from the end point of task v to the starting point of task w. No 
polynomial time heuristic for the ATSP is known that has a constant quality guarantee. Similarly, no 
competitive online algorithm is known. (This contrasts with the situation for the symmetric traveling 
salesman problem, where, e.g., REPLAN is known to be 2.5-competitive [11, 12].) To overcome this 
gap we decided to invoke a-posteriori analysis which can be viewed as an instance-based competitive 
analysis. 

We implemented several online algorithms (the AMSEL screenshot in Figure 4 shows the execution 
of three algorithms) and compared them to the optimum ATSP solution. The gaps were huge. 
Analyzing the ATSP solution we observed that some of the jobs created in the evening were executed 
in the morning and that some transportation tasks were performed before the items to be moved 
were actually available (as a result of a preceding move). This led us to consider more restricted 
versions of the ATSP. We introduced time windows and precedence constraints. (In fact, not much 
research had been spent on these ATSP versions, and significant work had to be invested to design 
and implement algorithms that can solve these problems to optimality for the range of instance sizes 
that come up in this application, see [5, 6, 7, 9]). Table 2 shows the gaps between our online solution 
and the three adversaries ATSP, ATSP with precedence constraints, and ATSP with time windows 
for some of the instances that came up at SNI. Clearly, a change of the offline adversary significantly 
changes our view of the performance of the online algorithm. Consider for instance problem 4 of 
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Figure 4: Screenshot of simulation system for automatic storage systems 



Table 2. The online solution needs about 70% more time than the ATSP solution, 34% more time 
than the ATSP solution when precedence constraints are required and only 3.1% more time if time 
windows are observed where the latter deviation is only an upper bound on the gap since this instance 
could not be solved to optimality (9.7 is a lower and 10.0 an upper bound on the optimum ATSP^^ 
value). 

The stacker crane problem displays a difficulty that frequently occurs in complex transportation 
problems. There is no “canonical” offline problem for the given online problem. For instance, the 
SNI logistics system is governed by certain rules that influence the request sequence for stacker 
crane moves, and in turn, the behavior of the stacker crane influences the other modules of the 
transportation system. For example, the input buffer of the AUSS is served by automatically guided 
vehicles. At most two items can be in the buffer. A vehicle may stand in front of the buffer carrying a 
further item for the AUSS. The transportation request for this item, however, is only generated when 
the item is moved into the buffer. On the other hand, if the waiting vehicle blocks other vehicles then 
a rule says that the vehicle must go away after 60 seconds. In general, it will make a turn and return 
to the buffer later on. This process shuffles the request sequence and leads to different generation 
times, time windows, etc. Therefore, none of our three offline adversaries is guaranteed to produce 
a sequence of stacker crane moves that can actually be executed in practice. 

This short discussion indicates that finding an appropriate offline model of an online problem is an 
unresolved issue. 

6.3 Actual Approach, Simulation Results, and Practical Impact 

Here we describe what we did in practice. Although the problem is known to be A/P-hard there are 
codes available so that the REPLAN heuristics can be used in real-time situations. To be on the safe 
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side, we implemented a three phases optimization process: 

Phase 1: Perform cheapest insertion of the new task. 

Phase 2: Run an iterative cheapest insertion on a random ordering of the current request 

sequence. Then pick the winner of Phase 1 and 2. 

Phase 3: Solve the ATSP with the current transportation tasks to optimality 

(branch&bound from [16]) and replace the old sequence completely by the optimal 
one (REPLAN). 

Phase 1 runs in time linear in the number of requests and is always completed. For the typical 
problem sizes that occur in our application (no. of requests is less than 60) this is done in fractions 
of a second. Even Phase 3 was always completed within a few seconds. If the stacker crane becomes 
idle, the optimization is interrupted, and the best sequence so far is passed to the stacker crane. 
Although one can construct request sequences where REPLAN does not give the best result among 
all online algorithms we implemented, our simulation experience showed that REPLAN empirically 
provides the best results on average. 

SNI provided data for one week of production. During this period, for one AUSS, each generated 
task and each move of the stacker crane was recorded. It turned out that in heavy load periods the 
times needed for unloaded moves could be reduced by approx. 30%. 

As a result the optimization package was put in use, and the results were confirmed in everyday 
production. Even in the production environment the optimization process could almost always finish 
with Phase 3. 

7. Conclusion 

We hope that we could demonstrate that combinatorial online optimization is an exciting area of 
operations research with practical relevance and impact. The theory so far developed is far from 
being satisfactory and being able to guide decisions in practice. However, the use of theory-based 
problem specific heuristics can improve the performance of “online systems” in practice. We have 
discussed here only relatively simple modules of much more complex logistics systems. The analy- 
sis and performance improvement of large systems is the “real goal” and requires further research 
efforts. In particular, it is not really clear at present how complex systems should be modeled, 
what the real objectives, constraints, etc. and what the interdependencies between them are. To 
put it in a nutshell, we do not know how to answer the following two questions convincingly: 

What is a good system? 

What is a good system performance ? 
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Summary: The purpose of this paper is to report on some experiments concerning the numerical 
solutions of a class of dynamic Cournot models. In doing so we shall put in evidence the importance 
of the assumption of strict diagonal concavity of the payoff functions in the numerical analysis of 
dynamical imperfect competition models. 



1. Introduction 

The Cournot equilibrium [8] is a cornerstone of modern economic theory [10]. It has had a very 
profound influence on the theory of industrial organization [20]. Operations research, which is a field 
traditionally concerned with the numerical solution of microeconomic models, has already contributed 
several approaches for the computation of static or dynamic equilibria. We may refer to [21] for an 
algorithm, based on a model specific primal-dual iteration, to solve the classical Cournot model. A 
more general class of algorithms is related to the theory of variational inequalities (VI see [9], [18]). 
To guarantee convergence of the Vl-based algorithms one usually requires a monotonicity property 
to hold. This property has also been used by Rosen [22], under the name of strict diagonal concavity 
(SDC) to prove uniqueness of equilibrium and convergence of a gradient-like algorithm in a class of 
concave games. The introduction of time and uncertainty in the Cournot paradigm confronts the 
modeler with a whole variety of information structures with varying degrees of difficulty to obtain a 
numerical solution. In the theory of sequential noncooperative game, with an infinite time horizon, 
there is no general convergence result concerning the numerical computation of subgame perfect 
Markov equilibrium strategies as indicated in [3]. In [14], an information structure called 5-adapted, 
has been proposed for a discrete time duopoly model of the gas market. This information structure 
permitted the use of stochastic programming technique to compute an equilibrium adapted to a set 
of random scenarios represented on an event tree. The extension of this concept to a differential 
game setting has still to be made. 

In the present paper we revisit the duopoly differential game model under the three information 
structures of open-loop, feedback and 5-adapted strategies and we explore the role played by the as- 
sumption of strict diagonal concavity in the existence of an equilibrium and convergence of numerical 
algorithms. 
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2. Cournot equilibrium 

2.1 The static Cournot model 

Let us first of all recall the basic Cournot oligopoly model. We consider a single market on which 
m firms are competing. The market is characterized by its (inverse) demand law p = D{Q) where 
p is the market clearing price and Q = Qj is fofsil supply of goods on the market. The 

firm j faces a cost of production Cj{qj)^ hence, letting q = represent the production 

decision vector of the m firms together, the profit of firm j is Trj(^) = qj D{Q) — Cj{qj). The following 
assumptions are placed on the model 

Assumption 1 The market demand and the firms cost functions satisfy the following (i) The in- 
verse demand function is finite valued, nonnegative and defined for all Q 6 [0, oo) . It is also twice 
differentiable, with D'{Q) < 0 wherever D{Q) > 0. In addition D{0) > 0. (ii) Cj{qj) is defined for 
all qj € [0, cxd), nonnegative, convex, twice continuously differentiable, and C\qj) > 0. (Hi) QD{Q) 
is bounded and strictly concave for all Q such that D(Q) > 0. 



2.2 Computing the solution of a classical Cournot model 



One may propose three different approaches for finding a solution to a static Cournot game: (a) one 
discretizes the decision space and uses the Lemke-Howson [17] algorithm to compute a solution to the 
associated bimatrix game; (b) one exploits the fact that the players are uniquely coupled through 
the condition that Q = T>j=i,...,mQj ^ sort of primal-dual search technique is used (Murphy, 

Sherali, Soyster, [21]); (c) one formulates the Nash-Cournot equilibrium problem as a variational 
inequality. Assumption 1 implies Strict Diagonal Concavity (SDC), in the sense of Rosen [22], i.e. 
strict monotonicity of the pseudo-gradient operator, hence uniqueness and convergence of various 
gradient like algorithms. 

Let us recall the definition given by Rosen of strict diagonal concavity. In a general concave game 
formalism one assumes that each player j e M = {!,..., m} controls the action qj G Qj and 
gets a payoff . • • , , ^m)* The payoff ^j(-) of player j is supposed to be strictly concave 

w.r.t. the player’s own decision qj. A Nash (-Cournot) equilibrium is defined as an m-tuple q* = 



(^J“, . . . , . . . , g;;) € Q = Qi X . . . X such that, for each player j G M, . . . , q], 

■4>Mh • • • - 9;. • • • . 9m) for all q, € Qj. 

Given r = (rj)j=i,...,m > 0 define the pseudogradient of the weighted payoff function 



. 9 ;)> 



«r(q,r) = E,=i mr>j(q) as p(q,r) = 



n^V>i(q) 



r2^V'2(q) 



. The function cr(q, r) is strictly diagonally 



L J 

concave (SDC) for q G Q (when all the rj are equal we also say that the payoffs are SDC) if for every 
q°, q^ G Q we have 

(q^ - q‘’)' 9 (q",r) + (q“ - q^)'g(q\r) > 0. 



A sufficient condition that cr(q, r) be diagonally strictly concave is that the symmetric matrix 
[G(q, r) -h G(q, r)'] be negative definite for q G Q, where G(q, r) is the Jacobian with respect to 
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q of ^(q, r). Indeed this definition of SDC is closely related to the assumption of monotonicity in 
the theory of variational inequalities as it will be shown later on. 



3. Dynamizing Cournot 

3.1 Production capacities as strategic variables 



Brock [4] has proposed a dynamic Cournot game format that is very close to the neoclassical economic 
growth model. Each firm j produces from the use of a physical capital kj{t). The capital accumulation 
of firm j is described by ^kj{t) = Uj{t) — fijkj(t), Aij(O) = kj, where fij is the capital depreciation 
rate and Uj{t) is the investment rate for firm j. In a simplified formulation we assume that the 
production is proportional to the capital stock; without loss of generality we may assume that the 
total supply on the market at time t is given by K{t) = kj{t). The market is characterized 
by the usual (inverse) demand law defining the market clearing price in function of the total supply 
7r{t) = kj{t)). Now the payoff to firm j is defined by 

roo 

•'0 j=i 



where is firm j cost function for holding the capital stock and investing (adjustment cost). 

One allows a zero discount rate p and one uses then the overtaking optimality concept (see [5]). 

In this formulation one notices the linearity of the state equations and the fact that they are com- 
pletely decoupled, each firm controlling only its own capital accumulation process. If one assumes 
a linear demand law tt = D{K) = ao - ^/3 q Ei=i,...,m with ao > 0, /?o > 0, and a quadratic cost, 
9 j(kj,Uj) = ajikj -f \Pj\{kjY + aj 2 Uj H- l/3j2{uj)‘^ where > 0, /3ji > 0, aj 2 > 0, /3j2 > 0, then the 
payoff rate of player j becomes 

Lj(k, Uj) = kj[cko — —pQ ^ /ci) — {oijikj 4- '^Pjiikj) 4 ocj2'^j + 

^ 1=1,. ..,m ^ ^ 



which can be rewritten, after rearranging terms 



LjiK Uj) = \ k^Qjk + hjk - 



( 0 



with hj^ = ( 0 ... ao — Uji ... 0 ) and Qj = 



-lA. 



0 \ 



-\Po ■■■ -(/?0 + A'l) ••• “oAo 



V 0 



( 2 ) 



. Indeed 






the dynamics is linear for each player. The strict diagonal concavity w.r.t. k of the instantaneous 
payoffs is implied by the negative definiteness of the matrix 

( ~{Po + Pn) • • • ~ 2 ^o 



Q = 



V -lA) 



~(^0 + /? jl ) •. 



-\Po 






— {Po + Pml) ) 
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In the duopoly case we obtain Qi = 
~\Pq — (A + A2) 



-(/^O + Al) -\0Q 

- 5 A 0 



and h/ = ( ao - an 0 ), 



Q 2 = 



0 






Yhr =(0 ao-a,3)andQ=f-(^“+/“) V The 

/ \ ~2^0 ~(A + A2) / 



negative definiteness of the payoffs will be obtained if (/?o + Ai)(/?o + P12) > which is always the 
case. 



4. Existence, uniqueness and stability of dynamic open-loop equilibrium 

In [5] a complete theory of existence, uniqueness and asymptotic stability (turnpike property) has 
been developed for the above model. Assumption 1 suffices to obtain this nice result. Actually, Carl- 
son and Haurie have considered a general class of concave games with decoupled dynamics, i.e where 
each player controls his/her sate equation Xj = fj{xj,uj) (e.g. the duopoly model with capacities as 
strategic variables), and receives a payoff according to a rate Lj{x{t),Uj(t)), where x = 

The result is based on the use of a Lyapunov function F(x, p) = - ~ PYi^j ~ in the 

phase space (x, p), where pj is the costate variable associated with player fs state variable xj in the 
hamiltonian and (x,p) is the unique steady-state solution of the ” pseudo” -hamiltonian system char- 
acterizing open-loop equilibrium solutions. The fundamental Lyapunov property jiV{x{t),p{t)) < 0 
results then from the strict diagonal concavity in x, convexity in p of the hamiltonians. We refer the 
reader to [5] for a detailed presentation that cannot be repeated here due to space limitation. The 
turnpike property is useful for the implementation of a numerical technique based on the solution of 
a Two Point Boundary Value problem (maximum principle), see e.g. [ 11 ]. 

4.1 Variational inequalities 

One may also use a direct time discretization approach to approximate the solution of an open-loop 
game by a solution of a variational inequality. We shall continue to use the control theoretic notation 
of X for the state (capital stock) variable and u for the control (investment) variable. We also use a 
quadratic holding and adjustment cost. 

4.1.1 Time discretization 

We consider here a duopoly game with a finite time horizon T. Define the sampled times t*. e T = 
{0, 6,2S,...,{K- 1)5, T} With 6 = T/K. Let x, = and u, = (n^(ik))f=, = 

Then one replaces the differential state equations with the difference equations 

Xj{k) = Uj{k)S + (1 - HjS)xj{k - 1) = (1 - + ar®(l - iijSf 

1=1 

4.1.2 Profit function for player j 

It is now possible to write the payoffs as functions of the whole history of the control variables only. 
Jj(ui,U 2 ) = ^e~'^'‘{D{xi{k),X2(k))xj{k)-{xj{k)f-{uj(k)f}6. 

k=l 




4.1.3 Open-loop Nash equilibrium of the discrete-time game 

Let u* = (uJ,U 2 )) be a solution of the following variational inequality: 



45 



{F{u*),u* -u)>0 WueUixU 2 

With u = [ I and F{u) = ( \ payoff functions implies strict 

V / V ^U3«/2(Ui,U2) J 

monotonicity of the operator F{u) and hence existence, uniqueness and convergence of gradient- 
like algorithms. This solution will be a converging approximation, when (5 -> 0, of the open-loop 
equilibrium of the dynamic Cournot game with capacities as strategic variables. 



5. Feedback information structure 

The open-loop equilibrium is not subgame perfect, in the sense of Selten [19], whereas the Nash- 
feedback solution is subgame perfect. It would be nice if the SDC property proved to be as helpful 
in the feedback information structure as it is in the open-loop case. Unfortunately the results are 
not totally satisfactory. 



5.1 The coupled dynamic programming equations 

The HJB equations for an m-player differential game with a state dynamics x = f{x,u\, . . . ,Um), 
Uj e Uj, is obtained when one considers the value functions 

fT 

Vj{t,x) = J Lj(s,x{s),u{{s,x{s)), ... ^u^{s,x{s))) ds with x{t) = X. 

Here we denote uj{t, x) G Uj a feedback law describing the strategy of Player j and 

Ui(t, X^j...jUj (t, x), . . . , x) 

a feedback m-tuple which is a Cournot-Nash equilibrium at any initial point (t, x). Under relatively 
restrictive assumptions of differentiability, these value functions should satisfy the following equations 

d d 

— Vj(t, j:) = ' ■ ' ’ • • • ’ (^) 

Vj{T,x) = 0 j = l,...,m. (4) 



As a particular case, assume a dynamics that is linear in ui, . . . , and a payoff that is quadratic 
in Uj and does not depend on Ui, i ^ j . Assume furthermore that there is a decoupling between 
state and controls. This means the following form for the payoff functions Lj(t,x,ui, . . . ,Um) = 
lj{x)-\-^ujRjUj+l3jUj and for the velocity /(x,iii,...,u„i) = Ax+Ej=i,...,m Now, applying the 
equilibrium conditions, we obtain Uj = —Rj^{l3j 4- -^Vj{t^x)Bj). This will define a local equilibrium, 
provided the matrix Rj is negative definite. Then, if one substitutes the expression of the local 
equilibrium in the HJB equation one obtains a set of coupled nonlinear partial differential equations 
in (t,x). 



= 9jix) + \WPj 



+ S-Vj{t,x)BjU^ + -l-Vj{t,x)(Ax- B,R-\l3i + ^Vi{t,x)Bi) 

1=1, ...,m 



dx 



dx 



Vj{T,x) = 0 j = l,.. 



( 5 ) 
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The numerical solution of the above equations still poses a formidable challenge to the OR and 
control professions. When the terminal time T tends to oo, the question of the asymptotic stability 
of the equilibrium arises. The conditions garanteeing this asymptotic global convergence are difficult 
to establish. 



5.2 A linear quadratic case 

We have seen above how the dynamic Cournot model could be formulated as a linear-quadratic 
differential game. We shall consider here a reduced version of the model reformulated in the classical 
'control theoretic format (cost minimization and no affine term in the cost). Another type of LQ 
Cournot game has already been discussed in [1]. 



5.2.1 Coupled Riccati equations 



Consider the particular case of two players j = 1,2 involved in a differential game defined by the 
state equations Xj{t) = AjXj{t) + BjUj{t)„ with a;(0) = and the payoffs 

Jf(x(-),«j(-)) = I [ {12 H ^k{tyQjM^e{t) + Uj(tyRjUj{i)}dt, (6) 

^ k=l,2t=l,2 

where Qjjj : rij x rij > 0 and Rj : rrij x rrij > 0 are supposed to be symmetric positive definite and 

QjM = Qj,tk- 



Assumption 2 The function <^(x) = Ej=i,2 Eib=i,2 
Ql,ll Q2,12 ] _j_ I ^1.11 Qhl2 

Qi,21 Q2,22 I \ <32,21 <32,22 



i.e the matrix Q = 



I3 /=i ,2 ^kQjykt^t strictly diagonally convex, 
^ is positive definite. 



Assume a quadratic form for the equilibrium value functions Vj{t,x) — | Eib=i,2 E/=i,2 
and let’s write the dynamic programming equation at equilibrium. The equilibrium feedback is 
defined by uj = Fjx = Ea=i ,2 ^jk^k where Fjj = and Fjk = -Rj^BjSjjk- The dynamic 

programming equations lead to the consideration of the following equations in 5i(t) and S 2 (t) also 
called the coupled Riccati equations (see [2] for details): 



-Si = Qi + 



•^Si 

Si{T) = 0 



F{iRiFii F{iRiFi2 
^11 -^12-^1 -^12 
Ai 4- BiFii B\Fi2 
B 2 F 21 A 2 + B 2 F 22 



( A-^Fi,B[ 

\ F{,B[ 



FiiB^2 



Si 



( 7 ) 



and 



—S 2 = <32 + 



+5*2 



^2(T) = 



^21 7^2 -^21 
F 22 B 2 F 21 



F 21 B 2 F 22 

F 22 R 2 F 22 



( A[ + F{,B[ 
\ F{,B[ 



Ai-hBiFii B1F12 

B 2 F 21 A 2 + B 2 F 22 



F2iB'2 

A '2 + F'BL 



S 2 



0 



(8) 
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where 

Fi = {-RT^B[Si,n, -R{^B[Si,n) = -Ri^B[Si, with Bi 
and 

F2 ~ ( — F>2 ^-^2*^2, 21) — F>2 ^■®2‘^2,22) “ -^2 ^-^2*^2 with B2 




0 

^2 



Introducing the notations A = 



Ai 0 
0 A 2 



,B = {BuB2)^Fi = {Fa,Fi2) i 



Ri = 



Ri 0 
0 0 



) R 2 = 



0 0 
0 R 2 



, we can rewrite equations (7) and (8) as 



1,2, F 



-Si = Q^-^F'RiF + {AABFySiASi{A + BF), 5i(T) = 0 

-52 = Q^-^F'R2F-\~{AABFyS2-^S2{A-^BF), 52(T) = 0. 



Fi 

F2 



(9) 

(10) 



If one considers an infinite time horizon then one looks for a stationary solution of the coupled Riccati 
equations, so we have to solve in Si, S 2 the following set of coupled Riccati equations 

0 = Qi-^F[RiFi-^{A-^BFySi-^Si{A~hBF) (11) 

0 = Q2 + i^2^2i^2 + (>l + 5F)'52 + S2(^ + 5F). (12) 



In order to get a solution to the infinite horizon problem one indeed needs an asymptotic stability 
property to hold. The Lyapunov function used in the open loop case to prove the global asymptotic 
property of equilibrium state-costate trajectories was defined as Ej=i, 2 Pj^j (with the steady state 
values normalized to 0). Remind that, under a differentiability assumption, pj = dvj{x)/dxj where 
Vj{x) is the open-loop value function for player j. Therefore it is natural, in the feedback case to 
consider the Lyapunov function ^x'Sx where the matrix S is defined as 



S = 



Sl,n 5'i,i2 I I ‘S' 2 , 2 / 

'S' 2,21 •S' 2,22 / \ Si^i 2 82,22 



(13) 



which is the symmetric quadratic form obtained from the expression | Z)j=i, 2 ^j The 

matrix S has been constructed from the matrices S\ and S 2 in a similar way the matrix Q is 
constructed from Qi and Q 2 - The SDC property says that Q is positive definite. It would be nice 
to obtain that S is also positive definite. One can check by direct calculation that S verifies the 
following differential equation 



~S = Q + 



4-5 



( FL 



RiFii 4- F22R2F21 



Ai 4- ^i-Fii 
B 2 F 21 



B 1 F 12 
A2 4- B2F22 



4- (5i - 52) 



«iFi2 \ Fi,B>2 Y 

J \ Fi2B[ A'2^F!22B'2 

( 0 BiFu \ / 0 -FI 21 BI 

V -B 2 F 21 0 ; V F[2B[ 0 



(5i - 52)'. 
(14) 



This equation has almost the form of a Riccati equation in S. The disturbing terms are those in 
{Si - S 2 ). They will prevent us from concluding that the SDC property for Qi and Q 2 will always 
translate into the positive definiteness of the matrix 5 and eventually into the appropriate property 
for the Lyapunov function x'Sx. However one may expect the positive definiteness of 5 to hold if the 
strict SDC holds with sufficient ’’curvature”, so as to impose the positive definiteness of 5, despite 
the terms in {Si — S 2 )’ 
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5.2.2 Proposed algorithm 

We propose here an algorithm for the solution of the algebraic Riccati equations of the linear quadratic 
duopoly game. It is inspired by the Kleinman [16] algorithm for optimal control and the Rosen 
pseudo-gradient algorithm [22] for concave games. 

• Find and F® such that Ai + is stable, 

• For A: = 1, 2, . . . 

Solve the linear system in 5'f , ^ 2 , 

r 0 = Qi H- F^'RiF^ -f- (.4 + BF^ySi + S^{A + BF’^) 
j 0 = O 2 + F^'R2F^ + {A BF^ySl + Sl{A + BF^) 

• Set 

(♦) = F^- e\UF^ + B'S'^), £ ( 0 , 1 ]. 

• If ||A*^|| = ||2(7^F*^ + B'S^)\\ < precision, end 

( Rk Rk \ 

1 pseudo-gradient of the pair of hamil- 

*^2,21 ^2,22 / 

tonians {x'H^x^xIH^x) where = Qi F^’RiF^ + (.4 + BFfS^ + S^A + BF^). Clearly this 
algorithm looks like a policy improvement method. It is well known that the policy improvement 
method which works finely for control problems does not extend without difficulties to dynamic 
games. Our intuition is that SDC should help in the convergence of this method. 

5.2.3 Numerical experiments 

In a first series of experiments reported below in Table 1, we have taken e” = 1. The results 
reported concern randomly generated problems with one dimensional state variable for each of the 
two players. They already show the importance of the SDC property of Qi, Q 2 to obtain convergence 
of the algorithm. However, as expected, the algorithm does not always converge, even when SDC 
holds for Qi,( 32- When the Q matrices are SDC and both Qj^jj and Qj^a are definite positive, the 
algorithm converged in 94% of the cases. The same situation with the Q matrices non SDC converged 
only in 75% of the cases. If the matrix Qjjj is positive definite but Qj^u is negative definite then the 
percentage of convergence drops to 84% when SDC holds and 45% when SDC does not hold. Here 
we see that the SDC property is not essential but is helpful in getting a stabilizing solution in the 
infinite horizon linear quadratic duopoly game, with feedback information structure. 

6. Stochastic duopoly models 

Stochasticity in oligopolistic competition can be modeled in various ways; for example differential 
equations describing capital accumulation or resource dynamics could be replaced with diffusion 
processes described by Ito equations; another possibility is to represent modal uncertainties affecting, 
for example, the demand law, as a stochastic jump processes (see e.g. [7, 13]). Indeed, in the most 
complicated version of the models the two types of uncertainties will appear in the description of the 
competitive system (see [12]). We shall consider here the information structures, called 5-adapted, 
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Conditions 


Convergence 


# iter. 


Qiyii ^ 0 
QiJj ^ ^ 
(Qi,Q2) sdc 


98% 


17 


Qi,» > 0 
Qi,ii ^ 0 

(Qi.Qa) Non SDC 


75% 


19 


QiM > 0 

Qijj ^ 0 
(QuQ2) sdc 


84% 


17 


Qi,ii > 0 

Qijj 0 

(QuQ 2 ) Non sdc 


45 % 


17 



Table 1: Convergence of the algorithm 



which permits the players to adapt their strategic choices at each jump time of a perturbing stochastic 
jump process. The computation of 5-adapted equilibria in stochastic games can be undertaken in 
a way very similar to stochastic programming for single player optimization (see e.g. [15] for a 
presentation of the main topics of stochastic programming). Let {^{t) : t > 0} continuous-time 
discrete-state Markov chain taking values in the finite set I. With each i € I is associated a particular 
demand law (we consider here a duopoly) D{xi^X2,i) = — Q z G /. The expected payoffs 

over a time horizon T are defined as 









( 15 ) 



6.1 Two information structures 



6.1.1 S-adapted information structure 

We consider the following information for constructing the players strategies 

• Let To = 0, Ti, = 0, 1, 2, ... be the successive jump times of the ^-process and = C(0), 

= f(ri), = ^(r,^) the visited states; 

• Call hj/ = {to, fo) ti, fi, . . . , Tj,, the random history up to jump u] 

• at any jump time each player j = 1,2, knowing (xj, x®) and hi, chooses an open-loop control 
Uj(') : [ti;,T) — R that will be used until the next jump occurs; 

6.1.2 Piecewise open-loop (POL) information structure 

The 5-adapted information structure should be compared with the piecewise open-loop information 
structure used in [13] where it is assumed that the players observe, at each jump time Ti, the discrete 
state and the continuous state x^ = (xi{ri,),X 2 (r„)). 
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6.2 Comparison of the two information structures 

6.2.1 Time consistency 

An equilibrium is time consistent, if, at any time 0, given that the equilibrium has been played 
up to that time, given the state (^(0),x*(^)) reached, the same pair of strategies would remain an 
equilibrium if one restarts the game at that time 6. 

6.2.2 Subgame perfectness 

An equilibrium is subgame perfect at jump times, if, at any jump time r„, whatever has been 
played up to that time, given the state reached, the same pair of strategies would remain an 

equilibrium if one restarts the game at that time 

6.2.3 Comparison 

• The 5-adapted information structure corresponds to a coarser information structure than POL 
with perfect recall. 

• An equilibrium based on Markov strategy in the POL information structure is time consistent 
and subgame perfect at the jump times. 

• An equilibrium based on a Markov strategies in the 5-adapted information structure would not 
always be time consistent. However an equilibrium based on the whole information available 
in the 5-adapted case will be time consistent, although not subgame perfect at jump times. 

6.3 Computation 

The 5-adapted equilibria can be approximated through an implementation of a Vl-algorithm on a 
discretized version of the stochastic duopoly model. The SDC property permits us to guarantee 
existence and uniqueness of the equilibrium for each of the approximating discrete time games. 
Results concerning the convergence of the discrete time approximations toward the continuous time 
5-adapted equilibrium will be reported shortly. 

7. Conclusion 

In this paper we have discussed some computational approaches for a class of dynamic Cournot 
oligopoly models, in a deterministic and stochastic context and with a finite or infinite time horizon. 
We have explored three different information structures for which the situation can be summarized 
as follows 

Open-loop: The existence, uniqueness and asymptotic stability of equilibrium trajectories can be 
guaranteed under an assumption of strong diagonal concavity of the payoff functions, linearity 
and decoupling of the state equations. The numerical computation exploits these properties. 

Markov-Feedback: This information structure applies to deterministic or fully stochastic mod- 
els (including disturbances in the form of jump and diffusion processes). The existence and 
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uniqueness of solutions is difficult to guarantee, even with strict diagonal concavity and a linear- 
quadratic (LQ) structure. We rely on numerical experiments to observe that, most of the time, 
under SDC, convergence to a unique equilibrium occurs in low dimensional LQ games. 

5-adapted: This information structure applies to a stochastic game, when the random disturbances 
are represented as an uncontrolled Markov chain. An approximate equilibrium can be com- 
puted, using stochastic programming techniques. Here again the strict diagonal concavity is 
helpful in guaranteeing the existence and uniqueness of equilibria in the approximating games. 
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Application of Topological Degree Theory to Semi-Definite 
Complementarity Problem 



A. Vladimir A. Bulavsky, CEMI RAS 

B. George Isac, Royal Military College 

C. Vyacheslav V. Kalashnikov, CEMI RAS 

1. Problem Specification, and Topological Degree Theory 

We examine the semi-definite complementarity problem (SDCP) in the standard form. Let be 
the Euclidean space of real n x n-matrices with the scalar product 

{A, B) = tx{A^B) VA, B e (1) 

Moreover, let 5”^” be the linear subspace of symmetrical real n x n-matrices, and 5” C 5”^” be 
the cone of positive semi-definite ones; from now on r4 > 0 means A e Interior points of this 
cone are positive definite matrices A, i.e. ^4 > 0 of full rank n. We denote that by A > 0. Lastly, 
introduce the norm at the space in the standard manner: \\A\\ = {A,Ay/‘^. 

Now consider a continuous mapping F : 5'”^” and define a solution X to SDCP as follows: 

X > 0, F{X) > 0, and (X, F{X)) = 0. (2) 

It is readily verified that for A,B£ the equality ti{AB) = 0 is tantamount to AB = 0. Therefore, 
we can reformulate SDCP as follows: find a matrix X > 0 such that 

F{X) > 0, and X • F{X) = 0. (3) 

It is well-known that every symmetrical matrix A 6 admits a complete orthogonal family of 
eigenvectors ai, 02, . • . , Moreover, its non-zero (necessarily real) eigenvalues Ai, A2, . . . , A*; (A: < n), 
together with the respective eigenvectors, form the spectral representation 

A = 5^Ajaiaf; (4) 

t=l 

here k = rank^ (cf. [1]). Making use of such spectral representations ^OI X = A and F{X) = B, we 
reduce the equality X • F(X) = 0 to the following system of equations: 

ajbj = 0, i=l,2,...,k; j = ( 5 ) 

where £ =rankF, and k-\-i < n. In particular, if rankX = rankA = n, we simply have F{X) = B = 0. 
In this paper, we introduce notions of exceptional and regular exceptional families of matrices and 
formulate the general (non-strict) alternative: for each continuous mapping F there exists either a 
solution to problem (2) or a (regular) exceptional family of matrices. Based upon this alternative, 
we prove both the well-known and new sufficient conditions of existence of solutions to the semi- 
definite complementarity problems. Like in [6], we do not demand that F be a monotone or coercive 
mapping, but in contrast to [3], [6], we make use of the topological degree theory. 
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Recall briefly the notion and some properties of the topological degree of a continuous mapping. 

Given a continuous mapping F : D C and a vector y e one often need knowing a 

number of solutions to the equation F{x) = y belonging to a particular setC C R^. A straightforward 
difficulty arises: the number of solutions in general doesn’t depend continuously upon either F or 
y. However, one can avoid this difficulty if one counts first a number of solutions x G C for which 
the mapping F keeps its ’’orientation” at some vicinity of the point x, and then subtracts from it a 
number of those solutions in C at which the ’’orientation” switches to the opposite. Thus obtained 
natural number is called a (topological) degree of the mapping F at the point y with respect to the 
set C and is often denoted by the symbol deg(F, C, y). 

The above definition of degree is by no means rigorous, since we have neither defined what the 
keeping or changing of ’’orientation” is nor described accurately the class of sets C to be considered. 
Nevertheless, even this non-rigorous definition shows that if deg(F, C, ?/) ^ 0, then at least one 
solution to the system F{x) = y exists and belongs to the set C. 

The notion of degree was originally introduced in the combinatorical topology, but its purely ana- 
lytical definition also exists (cf. [5]). 

Now we cite some key properties of the topological degree that will be used below. 

Poincare-Bohl Theorem (cf. [5]). Let C C R^ be an open bounded subset and F,G : d C R^ 
be two continuous mappings. If y E RP" is an arbitrary vector satisfying the condition 

y ^ H{z, t), zedC, te [0, 1], (6) 

where : cl C x [0, 1] — > R^ is a homotopy 

H{z,t)^tG{z) + {l-t)F{z), 



then deg(G, C, y) = deg(F, C, y). 

Kronecker Theorem (cf. [5]). Let C C BP be an open bounded subset and F : d C C BP BP be 
a continuous mapping. Ify^ F(dC) and deg(F, C, y) ^ 0, then equation F{x) = y has a solution in 
C. 

Since 5”^” is a finite-dimensional Euclidean space (with dim5”^” = n(n 4- l)/2), all the above 
theorems are well applicable to the considered problem. 

The paper is organized as follows. In Section 2, definitions of the exceptional and normed exceptional 
families of matrices are given and theorems on the alternative for the semi-definite complementarity 
problem are proven. Sufficient conditions that guarantee absence of exceptional families and thus 
provide for existence of solutions to the SDCP are presented in Section 3. 

2. Theorems on Alternative 

Let F : 5”^” -4 5”^” be a continuous mapping and 5” C 5”^” be the (convex closed) cone of 
positive semi-definite matrices. Denote by (S'”)* the dual cone (in the subspace 5”^”) for 5”, i.e. 

(5”)* = {Ye 5”^” : (y, X)>0 VX G 5” }. 
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It is well-known that the cone S'" is self-dual, i.e. (S")* = S". Therefore, the general complemen- 
tarity problem: to find a matrix X G S" such that 

F{X)e{Sl)\ and (X,F(X)) = 0, (7) 

is equivalent to the SDCP expressed by (2). 

Cone S" being a convex closed subset, every matrix Z 6 S"^" has the uniquely determined projection 
onto S": 

= FisnX = argmin{|lZ - X\\ : X e S"}. (8) 

Introduce the notation Z~ — — Z and remark that Z^ and Z~ depend continuously upon Z. 

Lemma 1. For every symmetrical matrix Z € S"^" the following relationships hold: Z~ € 5" and 
(Z~fZ+ = 0. 

Proof. Making use of the first-order optimality condition for problem (8), we write down the in- 
equality 

{Z - Z+-W)>0 e S". (9) 

Substituting W = 0 and W = 2Z^ into it, obtain {Z~ ^Z^) = 0. Now open the brackets in (9) and 
use the latter equality to deduce the inequality 

(Z-,VP>>0 

This means that Z~ € (S”)* = S" which implies Z~Z^ = {Z~)^Z^ = 0 and thus completes the 
proof. ■ 

Assertion of Lemma 1 means that Z~ is the normal element to the supporting hyperplane of the cone 
5" at the point Z~^ . 

Making use of spectral representation (4), it is easy to characterize and construct Z'^ and Z~ explicitly 
for a given symmetrical matrix Z. In particular, the following result obtains. 

Lemma 2. Let Z G 5"^" have a spectral representation 
k i 

Z = Ai > 0, /ij > 0, k-\-l< n. 

i=l 3—1 

Then ^ ^ 

z+ = Prs» z = Y. . Z- = Y 

i=l j=l 

Proof. Demonstrate that Z~^ satisfies the first order optimality condition (9), i.e. 

{Z - Z+, Z+ - IT) > 0 VVT G 5”. 

z-z+=~x:mM- 

i=i 



According to our assumptions. 
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For an arbitrary PF € 5”, make use of its spectral representation: 

m 

W = ^ i/iWiwf, i'i> 0, z = 1, . . . , m < n; 

t=i 

here Wi, i = 1, . . . , m are the orthonormal subsystem of the eigenvectors of matrix W corresponding 
to the positive eigenvalues. 

Now use the definition of the scalar product and write 

( e \ / ^ ^ 

- z) I ( z - z 

jz=l / \t=l i=l 

( i m \ i m 

Z Z Z (bj{bjwi)wf) = 

i=i i=l / j=l i=l 

i m 

= > 0 . 
j=l 1=1 

which establishes being a solution to projection problem (8). As the solution is unique, Lemma 
2 is proven. ■ 

Definition 1. A set of matrices {Zr}^>o ^ called an exceptional family (EF) with respect to 
the cone for a mapping F, if \\Zr\\ +oo as r — > +cx), and for each r > 0 there exists a scalar 
fir > 0 such that the matrix Mr = F{Zr) + jJirZr has the following properties: 

Mr > 0, and MrZr = 0. 

In the other words, for each r > 0, matrix Mr is the normal element to the supporting hyperplane 
of the cone 5” at the point Zr. 

Theorem 1. For each continuous mapping F : 5” -> there exists either a solution to problem 

(2) or an exceptional family of matrices. 

Proof. Consider equation 

f(Z) = F{Z+) -Z- = 0, (10) 

where Z'*' is projection (8), and = Z+ — Z. It is easy to see that problems (2) and (10) are 
tantamount in the following sense. If X solves (2), then Z = X — F{X) satisfies equation (10). 
Conversely, if Z is a solution to (10), then A' = Z+ solves problem (2). 

In order to investigate problem (10) in detail, consider a family of spheres V). and open balls Wr- 

Vr = {Z&S^^": ||Z||=r}, 

Wr = {Z & : \\Z\\<r}. 

Now examine the standard homotopy H{Z, t) of the identity mapping g{Z) = Z and the continuous 
mapping /(Z): 

H{Z,t) = tZ + il-t)f(Z), 0<t<l. 
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Apply the Poincare-Bohl Theorem for y = 0, C = Wr and write down ff(Z, t) for arbitrary Z e 
dWr = Vr and t € [0, 1]: 

H{Z, t) = tZ-^ + (1 - t)F(Z+) - Z". (11) 

Two outcomes are possible; 

A ) There is a r > 0 such that 



H(z,t)^o, vzeK, vte[o,i]. 

Then the Poincare-Bohl Theorem allows one to conclude that deg(/, Wr,0) = deg{g,Wr,0). The 
identity mapping degree is equal to unity: deg(p, lTr,0) = 1. Therefore, deg{f,Wr,0) = 1. Now the 
Kronecker Theorem implies existence of solution to (10) within the ball cl Wr, hence problem (2) is 
solvable, too. 

B) For every r > 0 there exist a matrix Zr €Vr and a scalar tr G [0, 1) such that 

H(Zr,tr)=0. ( 12 ) 

If tr = 0, then Zr solves (10), which again implies solvability of problem (2). Otherwise, if U > 0, 
then it follows from (11) and (12) that 






(13) 



Dividing both parts of the latter equation by (1 - tr), we obtain 



F(Z^) + 





(14) 



Put fjLr = tr/{l - tr) and Verify that {Z+} is an EF for 5” and F. Indeed, from (14) and Lemma 1, 
it follows that the matrix Mr = F{Z;I') + firZ^ is a normal element of the supporting hyperplane to 
the cone 5” at the point Z+. 

In order to demonstrate the family {Z;^} to be exceptional, one needs to verify that ||Z+|| H-oo as 
j- 4 - 00 . On the contrary, suppose that the family {Z;^} has a finite accumulation point. On the 
one hand, the equality ||Z"|| = yjr"^ - ||Z+||2 implies that the right-hand side of (13) comprises an 
unbounded sequence of matrices. On the other hand, the respective matrices in the left-hand side of 
(13) compose a bounded family due to the continuity of the mapping F. Thus obtained contradiction 
completes the proof. ■ 



Definition 2. Exceptional family of matrices {Zr} is referred to as a regular one (REF), if 



\\Zr\\ = r Vr>0. 



By modifying the proof of Theorem 1, the following result obtains easily. 

Theorem 2. For each continuous mapping F ; 5” -> 5"^” there exists either a solution to problem 
(2), or a regular exceptional family of matrices (REF). 

3. Sufficient Conditions of Solution Existence 
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By making use of Theorems 1 and 2, one can obtain new proofs of previously known results, as well as 
deduce some new sufficient conditions that guarantee existence of solutions to the complementarity 
problem. Begin with the following result. 

Theorem 3. Let F : S'" -> 5"^" be a continuous mapping and there exist a function : [0, +oo) -> 
[0, H-oo) with the properties 



^ li^ (p(t) = H-oo, p(t) > 0 for t > 0, and 

{X - Y, F(X) - F(Y)) > ||X - Yy (||X - y II) , VX, y 6 5:. (15) 

Then problem (2) with the mapping F is solvable. 

Proof. Suppose that there exists an EF {Zr}r>o C 5" for the mapping F. According to the 
definition of EF, matrices Mr = F{Zr) H- /Xr^r and Mi = F{Zi) H- fiiZi are the normals of the 
supporing hyperplanes to the cone 5" at the points Zr and Zi, respectively. Making use of this fact, 
write down the following sequence of relationships 



(Zr — Zi, F(Zr) — F(Zi)) — {Zr ~ Z\^ Mr — M\ — fJLrZr + PlZi) = 



= {Zr — Zi, Mr) — {Zr ~ Zi, Mi) — {Zr — Zi, pLr^r “ Ml^l) < 

< —{Zr — Zi, p>r{^r ~ Zi) — (/Xi — p.r)Z-\) = 

= —pr^Zr — ZilP H- {pi — p.r){Zr “ ■^’i, Zi). (16) 

By conditions (15), (16), and Caiichy-Buniakowski inequality, finally obtain 

ifiWZr - ZiW) < -pirWZr ~ ^l|| + |/Xi - /X,|||Zi||, 

which contradicts the condition p{t) -> H-oo as t H-oo; here \\Zr - Zi\\ serves as t when r -> H-oo. 
Thus the cone 5^ and mapping F forbid EF to exist. Therefore, problem (2) is solvable, and the 
Theorem is proven. ■ 

Remark 1. If (p{t) = t, the mapping F satisfying (15) is called strongly monotone, in contrast to 
simply monotone one for which (15) holds with (p{t) = 0. 

Theorem 4. Consider a continuous mapping F : 5" 5"^" and a bounded subset C C 5" such 

that for every X G 5" \ C there exists a matrix Y G C with 

{X-Y,F{X))>0. (17) 



Then problem (2) has a solution. 

Proof. Suppose that an EF {Zr} C 5" exists. From some r > 0 onward, the matrices Zr belong 
to 5" \ C. According to the Theorem’s assumptions, there exists a matrix Yr e C such that the 
following relationships hold: 



Q<{Zr- Yr, F{Zr)) = (Z, - F,, M, - /X,Z,), 
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with matrices Mr associated with Zr in the definition of an exceptional family. Re-arrange the terms 
of the latter inequality and obtain 

{Zr - Yr, Mr) - + MYr, ^r) < -Mrl|^r|| (||^r|| ~ 11^11) < 0, 



which leads to the contradiction. Therefore, no EF exists, and problem (2) is solvable as it follows 
from Theorem 1. ■ 



Example 1. Construct a mapping F : 5^ -> as follows. If a matrix X E has a spectral 
representation 

X = /xiOiaf + ^2^20^5 



then set 

F{X) = AiUiaf + A2020^, 



where 



Ai = /ii + 3sin/i2 - 1, A2 = //2 4- 3sin/xi - 1. 



The mapping F is not monotone, but it is continuous and satisfies (17) over the cone 5^ (the unity 
ball Wi can be picked up as the set C). According to Theorem 4, the SDCP for this mapping has a 
solution. In particular, the solution set includes the matrices 



A = 





andD.(« 



where 0 < f < 1 solves the transcendental equation f + 3sinf = 1. 



Corollary 1. Let F : 5” 5”^” be a continuous mapping and C C 5” 6e a bounded subset such 

that (y, F(y)) > 0 Vy G 5" \ C. Then problem (2) is solvable. If the strict inequality {Y, F{Y)) > 0 
takes place for each Y G \ (7, then all the solutions of problem (2) belong to the subset C. 

Corollary 2. Let F : be a continuous mapping and C C be a bounded subset such 

that F{Y) > 0 vy G S^\C . Then problem (2) is solvable. Moreover, z/ 0 G C and the stronger 
condition F{Y) > 0 is valid for every Y e S'l\C, then all the solutions of problem (2) belong to the 
subset C. 



Consider further a compact subset C C 5” which is star-like with respect to the origin, i.e. it contains 
the whole segment [0,7^] if a matrix X belongs to C. Then the function r]{X) is well-defined that 
maps X G 5!J, A ^ 0 into the most distant (from the origin) point of the set C in the ray running 
through 0 and the point X. It is clear that the subset F = {y = r/(X) : X G 5” , X 7^ 0} constitutes 
the relative border of the subset C with respect to the cone 5”. 



Theorem 5. Consider a continuous mapping F : 5” — > 5”^” and a non-empty subset (7 C 5” that 
is compact and star-like with respect to the origin. If the function r}{X) is continuous, r]{X) ^ 0, 
and (y, F(y)) > 0 for every Y eT, then problem (2) has a solution in the subset C. 



Proof. Define a mapping G : 5” -> 5”^” as follows: 






F{X), 

F(r,(X)) + ||X-,?(X)l|X, 



ifX ec-, 
ifxesixc. 



( 18 ) 
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The mapping G is clearly continuous over 5” and satisfies the following inequality 

(X,G(X))>0 XeSl\C, 



The latter provides for existence of a solution to problem (2) with the mapping G over the subset 
C. However, due to (18), the mappings G and F coincide over C. Therefore, problem (2) with the 
mapping F is solvable over subset C, too. This completes the proof. ■ 



Example 2. Construct a mapping F \ S\ as follows. If a matrix X e S‘1 has a spectral 

representation 

X = fliGitti + fl2d2(^2^ 



then set 



F{X) = + \2(l2(^2 •> 



where 



Ai = 3/Xi - /ii/Z2 - 1, A 2 = 3/i2 - Ail/i2 + 1- 



Take the set { X e S‘1 : /^i + //2 £ 1} for the subset C. It is easy to see that the latter satisfies 
conditions of Theorem 5. Indeed, for the relative border of the subset C we obtain 



(y; F{Y)) = tr [(faiflf + (1 - ?)«24’) + 2^ - l) Oiaf+ 

((1 - e)" + 2(1 - 0 + l) 0202 )] = .e" + (1 - 0® + 2 [e + (1 - 0"] +1-2?. 

Transforming the linear and quadratic terms, we come to the following inequality: 

2 [?^ + (1 - ?)^] + 1 - 2? = 4?^ - 6? + 3 > 0 VO < ? < 1. 



Finally we get 

(y F{Y)) > ?* + (1 - ?)® > 0 VO < ? < 1. 

Therefore, problem (2) has a solution in the simplex C, although the mapping F is clearly non- 
monotone. For instance, the matrix 



- (? :) 



solves the problem. 



Now we extend Theorem 5, giving up the continuity of the function rj. In order to do that, first 
reduce the domain of the function rj to the subset 0 = Vi fl 5” which is an intersection of the unity 
sphere around the origin and the cone 5”. Furthermore, we again assume that t){X) ^ 0 \/X G 0, 
but do not demand the function 7] to be continuous over 0. It clearly allows the subset C to have 
the border which contains the whole segments of rays emitting from the origin. 

Theorem 6. Consider a continuous mapping F : 5” — >• 5”^” and a non-empty compact subset 
C C 5” which is star-like with respect to the origin and 0 ^ F. If (F, F{Y) > 0 for each matrix Y of 
the relative border F, then problem (2) has a solution in the subset C. 

In order to deduce new sufficient conditions for existence of solutions to the general complementarity 
problem, we examine some properties of the exceptional family of elements. 
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Lemma 3. For each element of an exceptional family Zr G 5” the following relationships are true: 
(i) {U,F{Zr)) < 0 for every non-zero matrice U that belongs to the coneys facet of the minimal 
dimension containing the element Z^ and satisfies the inequality (C/, Zr) > 0; 

(a) {D^F{Zr)) > 0 for every (nonzero) feasible direction D from the point Zr (i.e. not leading 
outside the cone S^) and such that {D, Zr) = 0. 

Proof. Recall that for each matrix Zr G of the exceptional family, there exists a scalar > 0 
such that the matrix Mr = F{Zr) KZr is an interior normal element of a supporting hyperplane 
to the cone 5” at the point Zr. Consider a direction U that lies in the cone’s minimal dimension 
facet containing Zr. The properties of a supporting hyperplane imply the equality {[/, Mr) = 0 that 
is tantamount to (U,F{Zr)) = —Xr(U,Zr). Moreover if {U,Zr) > 0, then assertion (i) follows from 
the latter equality. 

Again use the properties of the normal of a supporting hyperplane (cf. Lemma 1) and obtain 
inequality (JD, Mr) > 0 for each feasible direction D at the point Zr. In additon, if {D, Zr) = 0, the 
required inequality (ii) follows immediately from the structure of matrix Mr. The proof is complete. 

■ 

Theorem 7. Let F : 5” — > 5^^^” be a continuous mapping and C C 5” 6e a non-empty bounded 
subset such that for every X e S'l\C (at least) one of the following assumptions is valid: 

(i) (U,F{X)) > 0 for a non-zero matrix U that belongs to the cone’s minimal dimension facet 
containing the matrix X, and satisfies {U, X) > 0; 

(ii) (D, F{X)) < 0 for a non-zero feasible direction D at the point X with {D,X) = 0. 

Then problem (2) is solvable. Moreover if assumption (i) holds for U = X, all the solutions of (2) 
belong to C. 

Proof. Suppose an exceptional family of matrices {Zr} C to exist. From some f > 0 onward, we 
have Zr £ S'^\ C. According to the Theorem assumptions, at least one of the conditions (z) or {ii) is 
valid for the elements Z^ r > f which contradicts the assertion of Lemma 2. That denies existence 
of exceptional families and therefore establishes solvability of problem (2) (see Theorem 1). At last, 
if either the stronger version of assumption (i), or assumption (ii) take place, we deduce for every 
A G 5” \C either (A, F{X)) > 0, or F{X) ^ S'”, respectively. Consequently, all the solutions of (2) 
belong to C, which completes the proof. ■ 

By making use of the techniques from proofs of Theorems 5-6, we obtain easily the following corollary. 

Corollary 3. Let F : be a continuous mapping, and a non-empty subset C C S” 6e 

compact and star-like with respect to the origin with 0 ^ F. If for every A G S” \ {0} (at least) one 
of the following assumptions concerning Y = r]{X) is valid: 

(i) {U, F{Y)) > 0 for a non-zero matrix U that lies in the cone’s minimal dimension facet containing 
the matrix Y, and satisfies {U, Y) > 0; 

(ii) {D, F{Y)) < 0 for a non-zero feasible direction D at the point Y such that {D, Y) = 0, 

then problem (2) is solvable. Moreover if (i) holds for U = Y, then there exists a solution of (2) 
belonging to C. 
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On the First-Order Estimation of Multipliers from 
Kuhn-Tucker Systems in Multiplier Methods Using 
Variable Reduction 



E. Mijangos, Euskal Herriko Unibertsitatea (Spain) 

N. Nabona, Universitat Politecnica de Catalunya (Spain) 

Summary: The minimization of a nonlinear function with linear and nonlinear constraints and 

simple bounds can be performed by minimizing an Augmented Lagrangian function that includes only 
the nonlinear constraints subject to the linear constraints and simple bounds. It is then necessary 
to estimate the multipliers of the nonlinear constraints, and variable reduction techniques can be 
used to carry out the successive minimizations. This procedure is particularly interesting in case of 
that the linear constraints are flow conservation equations, as there exist eflScient techniques to solve 
nonlinear network problems. In this work the possibility of estimating those multipliers through the 
Kuhn-Tucker optimality conditions is analyzed and compared with the Hestenes-Powell multiplier 
estimation. A method is put forward to identify when the first procedure can be safely used indicating 
an efficient way to compute these estimations. Computational tests are included. 



1. Introduction and motivation 

Consider the constrained problem (CP) 



minimize f{x) (1) 

subject to: Ax = 6 (2) 

c{x) = 0 (3) 

I < X <u, (4) 

where: / : IR” -> IR is nonlinear and / G on the feasible set defined by constraints (2-4); A is 
an m X n matrix and b an m- vector; and c : IR'* — ^ IR'’, is such that c = [ci, • • • ,Cr]^ Ci(x) being 
linear or nonlinear and c, G on the feasible set defined by constraints (2) and (4) Vi = 1, • • • , r. In 
the particular case of A being a node-arc incidence matrix, the constraints (2) are flow conservation 
equations and the constraints (3) are called side constraints. 

To solve this problem one can use Partial Augmented Lagrangian techniques [1], which are suggested 
in [3, 4, 8], where only the general constraints (3) are included in the Lagrangian. 

In the application of these techniques there are two fundamental steps. The first one is the solution 
of the subproblem (LS) 



minhnize Lp(x, /x) 


(5) 


subject to Ax = b 


(6) 


1 < X <u^ 


(7) 



where p > 0 and /x are fixed, and Lp(x, /i) = f{x) + /x^c(x) + f c(x)‘c(x). 
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Should the solution x obtained be infeasible with respect to (3), the second step, which is the updating 
of the estimation // of the Lagrange multipliers of constraints (3), is carried out, also updating, if 
necessary, the penalty parameter p, then going back to the first step. 

Should X be feasible (or the violation of constraints (3) be sufficiently small) the procedure ends. 
The active set technique of Murtagh and Saunders [13] solves efficiently subproblem LS, and in the 
case of constraints (6) being flow conservation equations, the previous technique can be specialized 
to take advantage of the network structure of matrix A (see [8, 11, 15]), achieving a high efficiency, 
this being the main reason to take into account the Partial Augmented Lagrangian techniques in the 
solution of CP (1-4). 

An important issue in this kind of methods is that the estimation of multipliers /i must be as accurate 
as possible, otherwise the convergence of the algorithm can be severely handicapped, as shown in 
[1, 2]. In practice there are two first order procedures to estimate fx. On the one hand, the updating 
put forward by Hestenes [7] and Powell [14], referred to as Uhp in this work, 

fL = fi-\- pc{x), (8) 

and suggested for this problem type in [1, 2, 3, 4, 8, 11]; and, on the other hand, the method based 
on computing the estimate pi through the classical solution by least squares of the (overdetermined) 
system of Kuhn-Tucker conditions (KT-system), referred to here as Ukt^ 

V f{x) + Vc(x)/z + A*7t -h a = 0. (9) 

However, whereas the first procedure can be always used for any x, without hampering the con- 
vergence [1], this is not the case with the second procedure, as the KT-system is only known to 
be compatible at the optimizer x*, not being necessarily so at x, thus possibly giving rise to bad 
estimations shown up by large residuals for the KT-system. Through this work when referring to 
a compatible system we mean that the solution to the system exists and is unique, with zero residual. 
A study of the viability of using this multiplier estimation technique together with the solution of 
problem LS (5-7) by the Murtagh’s and Saunders’ procedure has been carried out in this work. In 
addition, a procedure is put forward to find out when the KT-system can be safely used indicating 
an efficient way to compute these multiplier estimations. Computational tests are included where 
the efficiency of both updates is compared. 

2. Analysis of compatibility 

The compatibility of the multiplier estimate obtained by solving the KT-system (9) with the variable 
reduction techniques and its relationship with the estimate of Hestenes and Powell (8) are analyzed 
along this section. 

Through this work we assume that x* is a regular point — i.e., the Jacobian of the active constraints 
at X* has full rank — and write Vc(a:) = [Vci(a;), • • • , Vcr(:z;)]. (The gradient is considered to be a 
column vector.) 

Each time subproblem LS (5-7) is solved exactly by Murtagh and Saunders’s active set method (see 
§1), we obtain an optimizer x and an associated partition of matrix A = [Ba | Sa | Na], and, thus, 
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the variable reduction matrix Za shown below: 

--b-/Sa 

Za= H , which satisfies AZa = 0. (10) 

0 

Since x is an optimizer of problem LS the necessary first-order optimality conditions must hold; i.e. 



there exist unique vectors ^ and A such that: 

Va:Lp(x,//) + ^'^ + A = 0 (11) 

Ax = h (12) 

Xi = lu i = t + (13) 

Xj = Uj, z = ?+l,---,n (14) 



where t is the number of basic and superbasic variables, and vector A satisfies the complementarity 
conditions in relation to the simple bounds (7). Obviously, expression (11) can be written as 

A^tt + Vc(x)[/i + pc{x)] + A = -V/(x), (15) 



which in matrix form is 

Vb^c(j) 01 r 7 f 1 rvB^/(x)‘ 

Sa 0 ti + pc(x) =- VsJ(x) , (16) 

.K V^^c(x) ij [ A J [v;,^/(x). 

where Vb^c(x) stands for the rows of Vc(x) associated with the rows of B\. Vsac{x) and Viv^c(x), 
and also the partition of V/(x), are defined similarly. Let us assume that matrix 



Ba Na }m 

= VbAxY I V/v^c(^ }r 

has full row rank. It is now possible to get from A{x) a full rank basic matrix B such that it contains 
matrix Ba- Once the basic matrix B has been defined, matrix S can be set up such that [B 5] 
contains Ba and Sa as submatrices, thus: 




with Sa = [S'a \ Sa]. The rest of columns from A{x) makes up submatrix N. See [9] for an efficient 
procedure to build up B from data available at x when subproblem LS (5-7) has been solved. 

Prom this partition [B S N] of A{x) we have the variable reduction matrix 



--B-^S' 

Z[x) = 11 , which satisfies A{x)Z{x) = 0. 

0 
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Premultiplying by Z{x) both sides of (15) we get the equivalent expression 

ZixfA^n -h Z{xyVc(x)[fi + pc{x)] 4- Z{xYX = -Z(xYVf{x). (18) 

According to the definition of Z{x) the following must hold Z(x)^Vc(x) = 0 and Z{xYA^ = 0. 
Furthermore, 

■A®- 

Z{xyX=[-{B-^Sy a O] =-(B-'5)‘A® + P, (19) 

lfJ 

where B denotes the set of indices associated with the columns of we define the index sets S and 
Af analogously. 

As can be seen, vector a Z(5)^V/(x) 6 will be null only if the following condition holds 

P = (5"^5)'P, (20) 

which in general is not satisfied, as can be easily proved; see [11]. 

Let Gk = [G\k, • • ' (with m = m + r) be the column of B~^S associated with the superbasic 

variable Xk\ then (20) can be recast as 

m 

= k€S. 

i=i 

The basic equivalent path pk of superbasic variable Xk is defined as the set of basic variables xi that 
have a nonzero entry aik in the column of B~^S corresponding to variable Xk. 

Taking into account the expressions (18-19) we get that the A;-th component of a is defined as 

0‘k= OikXf - Af , (21) 

where N is the index set of the columns of the submatrix 

(22) 

selected to make up the basis matrix B of A{x) (see expression (17)) and Gik, as previously defined, 
entry (i, k) of matrix B~^S. Furthermore, since by construction the set of indices associated with 
the columns of 5 is a subset of the set of indices associated with the columns of 5^, A*; = 0 holds for 
all k corresponding to a column of 5, hence (21) becomes 

«* = E (23) 

•€/8knjv 

It must be pointed out that vector a turns out to be the nonzero part of the residual vector corre- 
sponding to system (9), when, once the partition of matrix A(x) is fixed, it is solved calculating first 
(tt, pi) through the solution of system 

7T 7T 

B* = — Vb/(x), and then computing = -N* - Vj^f(x). 

. 1^1 LmJ 
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Next, the results of a series of propositions presented in [10] are given. 

• System (9) is compatible if and only if Z(x)*V f{x) = 0 holds. 

• Let r D 

BS = (24) 

[^ba^y ^sA^yI ^ 

and Za the matrix defined by expression (10). Matrix BS has full row rank if and only if Vc{xYZa 
has also full row rank. 

Premultiplying equation (9) by matrix Z\ and moving Z^V/(5) to the right hand side we obtain 



Z\Vc{x)n=-Z\Vf(x). 



(25) 



• Let us assume that BS has full row rank. System (9) is compatible if and only if system (25) is 
compatible. 

• Under the same conditions of the previous propositions, system (25) is compatible if and only if 
Z{xyVf{x) = 0 holds. 

Owing to these properties, we have the following consequences: 

• If a ^ 0 (see expression (23)), system (9) is not compatible, and neither is system (25). Therefore, 
in this case the estimate of /i* by procedure Ukt is not reliable. 

• If when building up the basic matrix B of A{x) it is unnecessary to take columns of A{x) that 
contain columns of Na^ we have a = Z(x)*V f(x) = 0, and the associated systems (9) and (25) are 
compatible. Moreover, the /jl obtained by solving either (9) or (25) will be the same. Therefore, it is 
enough to solve (25) to calculate fi in procedure Ukt- 

• If problem CP (1-4) has not simple bounds, i.e. if it is only defined by (1-3), then we always 
have that a = 0 for all x = x(/z, p), because solving the subproblem LS (5-7) without simple bounds 
amounts to having A = 0. 

• It is clear from the above results that in order to build up matrix B it is preferable to use only 
columns of A{x) associated with Ba and Sa • Moreover, whenever necessary, in the construction of 
B the second preferable choice are columns of nonbasic variables in LS belonging to the I columns 
associated with Na such that A/ = 0, for / 6 AT, if any, see (23). As a consequence we define 






Za — [Za Z^\ — 



n 

0 

0 



-Ba^Na 

0 

I 

0 



and in this case system (25) is replaced by the system z\Vc{x)ii = -Z\Vf{x). 

Next, by means of a proposition we consider the general case given at the beginning of this section. 



Proposition 1 //a = 0 and LS (5-7) is solved with exact minimization , the Ukt procedure is valid 
for estimating p * . Furthermore, when B can be built up following the rule given in the last item, the 
results obtained coincide with those found by means of the Uhp procedure. Otherwise, even ifa = 0, 
the estimates of p* obtained through Ukt o>'n>d Uhp are not the same. 

The proof of this proposition can be found in [10], and an efficient procedure for computing a is put 
forward in [9]. 
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3. On the solution of CP (1-4) 

Based on the previous sections, a method to compute the multiplier estimate Jx is given below. 
Procedure Ukt- 

1. Search for a basis matrix in A{x). If A{x) is not a full rank matrix, go to 3; otherwise, 

2. Compute a. If Tr := {||a|| < r} (r being a sufficiently small tolerance), fx is updated by 
solving system Z\Vc{x)fx = —Z\Vf(x), obtaining Jx\ otherwise, 

3. IX is updated by computing p, = fx-\- pc{x). 

When it is unnecessary to take columns of nonbasic variables to get a basis matrix B of A{x), a = 0 
automatically holds, and Z a = 

On the other hand, an exact minimization of subproblem LS (5-7) means that the reduced gradient 
of the Augmented Lagrangian at the optimizer x is null. However, in computational practice an 
exact convergence is not usually achieved and the standard procedure is to require that the norm of 
the reduced gradient be lower than a predetermined tolerance r. Moreover, it does not seem to be 
advantageous to maintain always the same tolerance, given that the highest precision is only required 
at the end of the minimization process. For this reason, a tolerance Tk is associated with the k-th. 
major iteration of the Partial Augmented Lagrangian techniques considered in §1 (in contrast to the 
minor iteration corresponding to the solution of subproblem LS) such that lim = 0. This kind of 

k-^oo 

minimization is usually known as an asymptotically exact minimization of the corresponding problem 
CP (1-4), see [1]. Hence, when Ukt can be performed, the fx obtained does not coincide with that 
calculated by using Uhp- The object of test Tr in step 2 is to check whether the residual error 
is small enough so that one can safely use Ukt- In the A;-th major iteration Ukt estimation will be 
employed whenever 

l|a*ll < n, ( 26 ) 

holds. An efficient procedure to compute the vector can be found in [9]. 

Next, a series of computational tests are performed to compare the efficiency of above procedure of es- 
timating IX with respect to the use of only the standard first-order estimate Uhp when asymptotically 
exact minimization is used. 

4. Implementation 

Code PFNRN03 [12] has been developed to try, among others, the previously described proce- 
dures. This code and its user’s guide can be accessed for academic purposes via anonymous-ftp at 
ftp-eio. upc.es in directory pub/onl/codes/pfnrn. (Comments, suggestions, and the description 
of any trouble or abnormal situations found when using it sent to mepmifee01g.ehu.es will be 
appreciated). 

This code has also the following special features: 

• The matrix A of the problem CP (1-4) corresponds to the node-arc incidence matrix of a network. 

• The general constraints c{x) = 0 are replaced with the two-sided inequality contraints c < c{x) < c, 
where c and c are bounds to the vector function c{x). Hence, the problem CP is the particular case 
of c = c = 0. The procedure Ukt for problem CP is extended in [10] in order to be able to solve 
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problem 

name 


arcs 


nodes 


side 
const r. 


active 
s.c. at X* 


Jacobian 

entries 


L12e2 


1524 


360 


180 


8 


180 


L14nl 


1524 


360 


36 


27 


538 


L12nl 


1524 


360 


180 


24 


180 


D12e2 


1524 


360 


180 


5 


183 


D13e2 


1524 


360 


360 


10 


364 


D16nl 


1524 


360 


36 


23 


96 


D23e2 


5420 


1200 


120 


18 


135 


D32el 


4008 


501 


50 


5 


199 


D51el 


18000 


3000 


30 


8 


526 


P3201 


3204 


721 


36 


12 


3112 


P4203 


5187 


1548 


91 


10 


4443 


P4201 


5187 


1548 


91 


32 


4443 


P4403 


7581 


2262 


133 


10 


6501 


P4401 


7581 


2262 


133 


32 


6501 



Table 1: Test problems. 

problems with two-sided inequality general contraints. 

5. Computational results 

In order to compare the efficiency of the procedure considered in §3 (here denoted Ukt) with respect 
to the Uhp only update a series of computational tests are performed, which consist of solving 
nonlinear network flow problems with linear and nonlinear side constraints (described in [8]) with 
code PFNRN03 using both methods to estimate fi. All these tests have been performed on a Sun 
Sparc 10/41 work station. 

Two types of problems are considered; real and artificial. The real problems solved are of Short-Term 
Hydrothermal Coordination of Electricity Generation [8]. They correspond to problems whose name 
starts with “P”, and they have two-sided inequality nonlinear side constraints. Problems created 
from DIMACS random network generators [5] are used as artificial problems: generators Rmfgen, 
Gridgen and Grid-on-Torus have been employed, linear side constraints have been created with 
the Di2no random generator [6], and they give rise to the problems whose first letter is an “L”. 
The nonlinear side constraints for the DIMACS networks are generated through the Diml random 
generator described in [8], and they give rise to whose first letter is an “D”. Several types of objective 
function used to create problems with DIMACS networks. The last two letters in the problem names 
starting with L or D indicate the type of objective function, “nl” functions are of the Namur family 
used in [15]. Objective function EIOl (“el” and “e2” functions) were designed by Heredia in [6] with 
a view to obtaining problems with a moderate number of superbasic variables at the optimizer. 

The table 1 shows the characteristics of the test problems specifying the number of arcs (variables). 
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problem 

name 


RESIDUAL 


Ukt results 


Uhp results 


^hpI^kt 


(a = 0)/Mit 


a 


Mit 


mit 


t 


Mit 


mit 


t 


L12e2 


9/9 


0 


10 


522 


6.01 


10 


521 


4.87 


0.81 


L14nl 


1/18 


3.35 


19 


2842 


98.27 


19 


3369 


119.98 


1.22 


L12nl 


10/10 


0 


11 


4346 


136.70 


12 


4458 


154.58 


1.13 


D12e2 


22/22 


0 


23 


1589 


21.92 


24 


1284 


15.88 


0.72 


D13e2 


20/21 


8.65 


22 


2684 


45.10 


21 


2497 


37.80 


0.84 


D16nl 


6/6 


0 


7 


8325 


337.89 


6 


11190 


685.41 


2.03 


D23e2 


29/29 


0 


30 


2889 


159.69 


29 


3321 


176.63 


1.11 


D32el 


29/32 (t) 


0 


33 


4387 


121.23 


20 


7963 


228.86 


1.89 


D51el 


29/31 (t) 


0 


32 


1384 


199.41 


13 


1170 


96.05 


0.48 


P3201 


9/11 


0.11 


12 


13950 


1824.48 


35 


12542 


1621.24 


0.89 


P4203 


6/6 


0 


7 


2878 


282.30 


10 


3056 


316.28 


1.12 


P4201 


5/7 


2.78 


8 


20398 


2195.39 


8 


22845 


2375.06 


1.08 


P4403 


4/7 


0.82 


8 


3882 


632.84 


8 


3681 


567.14 


0.90 


P4401 


6/7 


0.12 


8 


23370 


3455.86 


8 


26361 


3951.07 


1.14 



Table 2: Comparison of results. 



nodes (rows in A), side constraints (s.c.), active side constraints at the solution x*, and non-null 
entries in the Jacobian Vc(x). 

The table 2 shows the compared behavior of the Ukt and Uhp updates. Under the heading “RESID- 
UAL” the following data are given: in column “(a = 0)/Mit” the number of major iterations (over 
the possible total) in which a was zero or almost zero (in the first major iteration // = 0); and 
in column “a” the mean value of ||a|| in the major iterations where (26) did not hold. Under the 
heading “RESULTS” columns “Mit” , “mit” and “t” , respectively show, the number of major itera- 
tions, the number of minor iterations and the CPU seconds spent in the execution. The last column, 
''tHp/tKr\ contains the ratio of the total CPU time spent when using these procedures to solve the 
same problem. As can be observed, the efficiency ratio, is around 1. In relation to the 

problems with nonlinear side constraints ( “D” and “P” models) we can see that the most remarkable 
performances achieved with Ukt were those corresponding to problems “D16nl” and “D32el” , with 
an efficiency ratio in relation to Uhp of 2.03 and 1.89, respectively. However, in most of the tests 
performed the efficiency for Ukt and Uhp was similar. The symbols (f) and (J) in this table mean 
what there were a number of major iterations — initial ones — where it was not possible to get 
a basis matrix from the Jacobian of active constraints, specifically for (f) in the first three major 
iterations and for (J) in the first two. 

Trials were also carried out in which test Tr in step 2 of procedure Ukt (see §3) was omitted. In 
that case it was observed that there were executions of PFNRN03 in which did not converge to 
fi* and: 
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• either p tended to oo because of the violation of c{x) did not tend to zero, so far the algorithm 
stopped in the line search due the bad conditioning, 

• or the algorithm stopped in the line search due to unduly short steps in the minimization of the 
augmented Lagrangian, which appeared to be nonconvex. 

For example, in test problem “L14nl” the algorithm did stop at iteration 1027 with a value of the 
objective function 163.96 (184.979 at the feasible optimal solution) and an infinite norm of constraint 
violation equal to 6.72, which had previously augmented from value 4.71 to 7.91, after updating p, 
by Ukt and in spite of the fact that the penalty parameter had been augmented in the same major 
iteration. 

6. Conclusions 

In practice, nonlinear problems with general constraints can be solved by using Partial Augmented 
Lagrangian techniques making use of Kuhn-Tucker systems to update the multiplier estimates, and 
exploiting the active set techniques in the solution of the constrained subproblem. However, some 
safeguards as those described in step 2 of the procedure Ukt put forward are necessary. 

If the problem has simple bounds, we must take into account the value of the residual of the Kuhn- 
Tucker system. Once the basis matrix of coefficients in this system has been chosen, by taking 
advantage of the matrix of active constraints given by the subproblem, we can compute the norm of 
the corresponding residual vector. Through a test over this norm the algorithm decides whether the 
update of the multiplier estimates is performed by solving the system or using the original update 
of Hestenes and Powell. Therefore, both updates Uhp and Ukt allow us to exploit the efficiency 
of the Murtagh and Saunders’s techniques specialized in nonlinear network flow problems over the 
subproblems, provided that the residual norm be small enough. 

To compute p it is sufficient to solve a reduced system whose matrix has a dimension not greater than 
(s + Va) X To, where s and Va are the number of superbasic variables and of active side constraints, 
respectively. 

The conclusion obtained through the computational tests is that the estimate of Hestenes and Powell 
and that derived from solving the Kuhn-Tucker system have a similar efficiency, the second one having 
the disadvantage of the expensive cost of the matrix operations. 

With the network problems tested there is no significant computational advantage in using, whenever 
possible, the Kuhn-Tucker system to estimate the multipliers of the side constraints with respect to 
the Hestenes and Powell only update. 
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Properties of Some Convex Marginal Functions Without 
Constant Rank Regularity 



U. Wiirker, Technical University of Chemnitz 

Summary: We consider a convex optimization problem of two groups of variables x and y, where the 
objective function and the inequality constraints are jointly convex and continuously differentiable. 
Supposing a regularity condition and using a decomposition approach with the variables x at the 
lower level we get that the marginal function ip{y) with respect to y can be represented as the 
maximum of a finite number of convex functions. 

If additionally at a point the strong second order sufficient optimality condition (SOC) and 

the constant rank constraint qualification (CR) hold, then it is known that the solution function x{y) 
is a PC^-function and hence ^{y) has a local representation as maximum of K convex differentiable 
functions fi{y). In general, the finite number K and the (locally defined) functions depend on the 
point 2 /°, and no such representation of ip is known in the neighborhood of points y^ where (CR) is 
violated. 

But in the particular case under consideration better properties can be obtained without supposing 
(CR). Especially in the case p = 1 (y G P^), the functions fi{y) can be rearranged and partially 
redefined in such a way, that an analogous global representation holds. Then, all functions fi are 
defined and differentiable on int dom((p). A similar result can be obtained in the case p > 1 for linear 
and quadratic problems. 

1. Introduction 



We consider the jointly convex optimization problem 

q* = inf {/(a:)+9(j/) : [x,y) e M } , 

M = { (x,y) e IV' X BP : gj{x) + hj[y) < 0, j € Jo } 

with convex, continuously differentiable functions / ; R" — > g : R*" — > Pj : R" — > R* and 

hj : fV R^, V j 6 Jo, card(Jo) < +oo. This problem can be decomposed into the marginal 
functions 

ipj{y) = inf { f{x) : x £ Mj(y) } , 

Mj(y) = {x e BP : gj(x) + hj{y) <0, j e J } 
for arbitrary index sets J C Jq. Hereby the functions <fj{y) obviously fulfill the relations y>ji{y) < 
VjAy) V Ji £ J 2 ^ Jo. V 1/ € BP, and we have 

q* = inf { (fijAv) + q{y) ■■ y € dom((pjo) } , (3) 

y>Joiy) = max (pj{y) y € BP . (4) 

JCJo 

Obviously, the functions ip j are convex. Supposing a regularity condition (for instance, see Slater’s 
condition or MFCQ below) and boundedness of the optimal solution set 

M}(y) = { a: 6 Mj{y) : f(x) = q>j{y) } , 

then from parametric optimization we obtain that the subdifferential oi ip j can be calculated using 
the p-gradient of the Lagrange function 

Lj{x, y, X) = f(x) + Y, + hj{y)) . 



( 5 ) 
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d^j{y) = convj VyLj{x\y,\) = ^\jVhj{y) : A € Aj{x\y) | , 
Aj(x*, 1 /) = { (Xj)jej ■■ V/(a:*) + E Aj Vffj(x*) = 0 , > 0 V j e J, 

'• isj 

A,-te(x*) + ft,(j/)) = 0 Vjej}, 



( 6 ) 

(7) 



where x* is an arbitrary element of Mj{y) (cf., for instance, [3], [5], [6], [10]). 

Assuming the regularity condition mentioned above, the optimality property x* G Mj{y) is equivalent 
with the Karush-Kuhn-Tucker condition (KKT) 



V/(a;*)+EA,Vp,M=0, 

jeJ 

' 9j(x*) 4- hj{y) < 0, Xj >0, j e J, W 

, "b ^j{y)) ■ Aj = 0, j E J . 

Additionally results can be obtained considering the following assumptions: 



(MFCQ) We say the Mangasarian-Fromovitz constraint qualification holds at x* e Mj{y*) if there 
is a vector d G R” such that Vgj(x*)d < 0 V j G Jaj where Ja = { j ^ J ' 9j{^*) + hj{y*) = 0 } 
denotes the index set of active constraints. 



(SOC) The strong second order sufficient optimality condition holds at x* G Mj{y*) if for each 
A G Aj{x*,y*) and each vector d ^ 0 satisfying V9j{x*)d>= 0 V j G Ja the strict inequality 
d'^V^L(a;*,2/*,A)d>0holds. 

(CR) The constant rank constraint constraint qualification is fulfilled at x* G Mj{y*) if there exists 
an open neighborhood U of {x*,y*) such that for each arbitrary but fixed subset J' C J the 
set of gradients { V^(a:) , G J' } has constant rank for all {x^y) G U. 

Theorem 1 Consider the convex parametric programming problem (2) at (x,y) = {x*,y*) where x* 
is a optimal solution of (2) at the point y*. 

(i) If (MFCQ) and (SOC) are satisfied at this point, then there exist open neighborhoods U of y* 
and V of X* and a uniquely determined continuous function x : U V, such that x{y) is the unique 
optimal solution of (2) inV for all y E U. 

(ii) If in addition (CR) is fulfilled at (x*,y*) then x(-) defined in (i) is a PC^-function on an open 
neighborhood of y * . 

For the proof of these statements (for some more general problems) we refer to the literature (cf. 

[ 11 ], [ 12 ]). 

Property (ii) together with (4) gives us the local representation of the marginal function (fjoiy) as 
maximum of a finite number of convex differentiable functions: 

‘PJo iv) = . fi{y) , /i € C* , V y € (7(j/*) . (9) 

In general, the number K and the representation of the (locally defined) functions /» depend on the 
point y*, and no such representation of cpj^ is known in the neighborhood of points where (CR) is 
violated. 
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2. Maximum representation without CR: The onedimensional case 

Throughout this section it is assumed that the problem(s) (2) suffice the following general assump- 
tions: The functions f{x),gj{x) : R” and q{y),hj{y) : R^ -> R\ j G Jo, are convex and 

continuously differentiable on R^ or R\ respectively. For the constraints, a regularity condition is 
fulfilled (for instance, Slater’s condition), and the optimal solution set M}{y) is bounded for each 
y G dom((/?j). 

Under these assumptions, the subdifferential d(pj{y) of the extreme value function (pj{y) can be 
calculated as in formula (6), i.e., d(pj{y) is the image of the polyhedral set Aj{x*,y) from (7) via a 
linear mapping. From this we get the following result: 

Lemma 2 For each point y G int dom((^j) and each direction d G R^ d ^ 0, a set J' C J exists, 
such that the function (p j> (y) is differentiable at y = y and fulfills the properties 

^j{y) = o.nd ipj{y\ d) = (p'j,{y; d ) . 

Proof: Let x* G M*j{y) be an arbitrary fixed optimal solution of (2) at y = y. Furthermore, let A be 
an arbitrary vertex of the polyhedral set Kj{x*,y). Then we define the index set J' = { j e J : \j > 0} 
and consider the function (fj>{y) and the vector Aj/ (containing only the positive components of A). 
Since A being a vertex of Aj{x*,y) and 

Aj.{x\y) ^ Aj{x\y) n {A : A, = OJ ^ f} , 

the vector A j/ is also a vertex of Ajf{x*, y). Since all components of the multiplier \j> are positive and 
the set Aj>{x*,y) is bounded, from this we may conclude Aji{x*,y) = {Aj/}, and hence all gradients 
Vgj[x*), j £ J', are linearly independent (LICQ is fulfilled). 

Therefore, the set dy^j'(y) (which is the image of Aji{x*,y) via a linear mapping) is a singleton, too, 
and the function (pj>{y) is differentiable at y = y. 

Since the corresponding vectors x* and A j> remain being feasible in the Karush-Kuhn-Tucker con- 
dition (8) also with respect to the problem (pj>{y) (because of \j = 0 for j ^ J'), we have the 
equivalence p>j'{y) = P>j[y), too. 

For each direction d G d ^ 0, the directional derivative <^j(y; d) can be calculated via 

max (o,rf) . 

a^d(fj{y) 

This formula remains true if the polyhedral set d(pj{y) is substituted by the set of its vertices. This 

means that for fixed d there exists a vertex d of d(pj(y), for which (pj{y;d) = (a,d) holds. Since 

dpj{y) is a linear image of Aj(x*,y), there must be a vertex A of Aj( 2 ;*,y) for which the relation 

a = Y. ^jV/ij(y) holds. Selecting this vertex A for the determination of J' in the first part of the 

jeJ 

proof, we find a differentiable function pj>{y) with the subdifferential d(pj>{y) = {a}. Hence we have 
VJ'(y) = Vj{y) and d) = (a, d) = <p'j,(y; d). m 

The statement of Lemma 2 ensures that without supposing (CR) the maximum representation 

{pj^[y) = max pj[y) from (4) can be restricted to only those functions p>j{y), which are differ- 
J ^ J 0 

entiable (locally at y). For extending this result to the global case, first we have to consider a 
technique for piecewise continuation of differentiable convex functions. 
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Lemma 3 Let /i, f 2 '- ^ R^ he two convex differentiable functions and F{x) = msx{fi(x), / 2 (^)} 

the corresponding maximum function. Let I = (a, b) E R^ be a given nonempty interval and g{x) : 
cl(7) -> an additional convex function which is on I differentiable. Furthermore, assume that the 
relations 



g{a) = F{a), g'{a; 1) = F'(a; 1), g{b) = F(b), g\b; -1) = F'(6; -1), 



( 10 ) 



and g(x) > F(x) Vx e / hold. Then there exist two convex differentiable functions fi, h ■ Rf 

g{x), xel, 



for which the updated maximum function G(x) 
G{x) = max{/i(x),/ 2 (x)}. 






F{x), otherwise, 



can be represented as 



Proof: Since F{x) = max{/i(a:), / 2 (a:)} we have to consider two cases: If in (10) the function 

values and the directional derivatives of g{x) in both boundary points a and b coincide with the 
corresponding values of only one function fi^ (either io = 1 or zq = 2), then we can define 



( g{x), x^{a,b), 

\ /i„(x), otherwise, 



and / 3 _i„(x) =/ 3 _i„(x). 



Then, from (10) we can conclude that the function fioix) is convex and differentiably at the points a 
and b, too. Hence, the representation G{x) = max{fi{x), f 2 {x)} suffices the assertion of the lemma. 
In the remaining case (cf. Picture 1) we have without loss of generality 



g(a) = fi{a), g'{a; 1) = f[{a), g{b) = f^ib), g'{b; -1) = (11) 




Picture 1: Two active functions at a and b 

Then we set c = sup{o: : x < a, f 2 {x) > fi{x)} and d = inf{a; : a; > 6, fi{x) > /a (a:)}, which results 
in c < a < 6 < d, fi{x) > f 2 {x) Vx € (c, a) and f 2 (x) > fi{x) Vx G {b,d). If c > -oo or d < +oo, 
respectively, we have the additional properties 



/l(c)=/ 2 (c), /((C) > /'(C), /l(d)=/2(d), /((d) > /((d). 



( 12 ) 
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Using these notations now we define the functions 



/iW = 





X < a, 




f2(x), 


X < c. 


g{x), 


X € (a, 6), 


h(x) = ' 


conv{/i,/ 2 }(a;), 


X e (c, d). 


, f2(x), 


X >b, 




i hix), 


X > d. 



(13) 



Hereby conv{/i, / 2 }(o:) is the convex hull of fi{x) and f 2 {x) on the interval [c, d] (cf. theorem 5.6. in 
[15]), which is in the case under consideration a convex differentiable function on [c, d]. 

Since the functions fi{x) and f 2 {x) are piecewise defined, their continuity, diflFerentiability and con- 
vexity properties depend on the corresponding properties at the points a, 6, c and d. For fi (x) these 
relations follow immediately from relation (11). The analogous properties of f 2 (x) came from the 
construction of the points c and d, and the relations in (12). ■ 



Remark: The statement of Lemma 3 remains true, if it is applied to an arbitrary finite number of 
functions fi{x), i = 1, . . . , AT. Hereby it is essentially that the representations of F{x) and G{x) as 
maximum functions use the same number N of functions, i.e., the inclusion of g{x) does not increase 
the number of differentiable components within the maximum function. 



Theorem 4 Under the assumptions made above the representation 

V>Jo(v) = . max fi{y ) , /i 6 Vy € int dom(i^j„) , (14) 

with a finite number of convex differentiable functions fi(x), i = 1, . . . , AT, holds. 

Proof: Let k be the number of indices in the set J. In the case A; = 0 ( J = 0) the function (fj{y) 

is obviously constant, hence it is differentiable on R^. 

Now let us assume that for all functions (pj{y) with card(J) < k the statement of the theorem is 
true, i.e. there exists a finite number of differentiable functions fy\y) with 

¥>j{y)= (15) 

Then we consider an arbitrary fixed function (pj{y) with card( J) = A: + 1 and the corresponding set 
of exclusively active points 

Gj:= {y€ dom((pj) : ipj(y) > (pj^(y) V J' C J, J' ^ J } . 

Since all functions are continuous, Gj is an open set and can be represented as union of countable 
many disjunctive open intervals Tj = (aj,5i), i = 1,2,... Furthermore, because of Lemma 2 the 
function ^pj{y) is differentiable on Gj. 

The definition of Gj and relation (4) provide that (pj{y) = max ^j>{y) is true for all y i Gj. 
In this formula, we have card(J') < card(J) = A: + 1 such that (15) may be applied for each of the 
finitely many functions (fj>{y). Setting F[y) = ^ m^^^ max^^^^ fy'\y), after renumbering we have 

^j{y) = F{y)= ma.x fr(y) My^Gj 

r=\,...,N 

(with N < E N{J') < oo). 



(16) 
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Lemma 2 provides that for each of the intervals Tj = (a*, 6j), 2 = 1, 2, . . there exist indices k and u 
for which the relations 



‘fijidi) = /li(ai) , ip'j{oi; 1) = //.(oj) , ifij(bi) = and (p'j{bi\ -1) = -f^.{bi) 

hold. Hence we can apply Lemma 3 with respect to the two differentiable functions fi. and fn and 
the additional function (pj{y) which is defined and differentiable only on (aj,6i). In the result, after 
a renumbering and piecewise redefinition of the functions //. and /r, , respectively, we have two new 
differentiable functions fi- and /^ . , for which 

(fijiy) = max{ft,(y), fi,{y)} (17) 



holds for ye (a*, &j), too. 

After renaming fj := fj, j = U and j = r^, this procedure can be repeated with respect to the 
next interval Tj+i. Due to the proof of Lemma 3, the redefinition Jr{y) := conv { f fn}{y) occurs 
only in such an interval (c, d), where the function fr{y) is strongly less than the maximum function 
F{y). This fact and the relation Tj^ fl Tj^ = 0 for ji ^ j 2 ensure that the result remains true also 
after countable many steps accordingly to each of the intervals Ti,T 2 , . . . Hence (17) is valid for all 
i = 1,2,..., which gives us together with (16) the assertion of the Theorem. ■ 

Remark: In the multidimensional case ?/ € R^, p > 1, the result of theorem 4 gives us only a 

directionally representation of (fjoiy) via 

‘PJoiv + t-d)= t€R\ 

where the differentiable functions fr{t) depend on the point y e int dom((pjo) and the direction 
d € R^. On the other hand, until now no counterexamples are known for the validity of an analogous 
multidimensional representation (p{y) = fr{y)- But in opposition to the proof of Lemma 2, the 

latter proof of Theorem 4 does not seem to be generalizable for p > 1, because the areas Gj C R^ may 
have more than two nonempty boundary segments 5^ = {p € bd(Gj) : <pj(p) = fr{y)}i = 1, . . . , A^, 
and each segment Sr may consists of countable many disjunctive components. Then there is no 
similar way to rearrange the functions /r(p) without producing contradictions. 

Alternatively, in the following section we investigate two simple particular cases, for which well- 
known results from literature allow us to obtain the representation (p{y) = max^/r(p) via an 
explicit construction of the functions fr(y)- 

3. The multidimensional linear and quadratic case 

First we consider the case where all functions /, q, Qj and /ij, j e Jo, were affinlinear. Then, via 
applying an affinlinear transformation, the general extreme value function (1) can be rewritten in 
the form 

'Pl{v) = : Ax <y, x>0} , 

xcR 



( 18 ) 
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where ipiiv) depends on the parameter y € i4 is a fixed (m, n)-matrix and c € R” a fixed vector. 
For all parameters y from the solubility set A = {y E R"^ : |<^l( 2/)I < this linear program can 
be rewritten in its dual form 

^L(y) = sup {y^z} , Ml = {z : A~^ z > c, 2 ; > 0} . (19) 

z€Ml 

Since the feasible set Ml of this problem does not depend on the parameter t/, we can use a finite 
number of (extreme) points 2 ;* of Ml for representing the extreme value function in the form 

<PL{y) = , - (20) 

Hence, in the linear case (18) it turns out that the extreme value function (pL{y) is on the whole 
solubility set A the maximum of a finite number of linear functions fi(y) = y^z\ Taking into account 
that y is the image of an affinlinear mapping, the original extreme value function also turns out to 
be the maximum of a finite number of affinlinear functions on its effective domain (cf. [2]). 

As a second special case we consider the quadratic programming problem 

ifiQiy) = I ^x^Qx+p^x : Ax < y , x > 0 | , (21) 

where the extreme value function (pq^y) depends on the parameter y E R”*. We assume that Q is a 
symmetric and positive semidefinite (n, n)-matrix, A is a (m, n)-matrix and p € R”. 

With respect to the solubility set A= {y E R”* : \pq{y)\ < oo}, we have the following results from 
literature (for instance, cf. [1]): 

N 

Lemma 5 There exist a finite partitioning A = \J cl(Ar), where each of the local stability sets Ar 

r=l 

is the relative interior of a some convex polyhedral set with constant dimension dim(Ar) = dim(A), 
r = 1, . . . , iV. On each set cl(Ar) the extreme value function pq{y) is a convex quadratic function of 
the vector y, i.e., it can be represented as 

<^Q(y) = '^r(y) := ^y^0ry + yV+9r Vy€d(Ar), (22) 

where Qr, p^ and Qr are fixed matrices, vectors and scalars, respectively. 

Note that the quadratic functions pr{y) = may be easily expanded from y E cl(Ar) 

to all 2 / G R^, but in general the extreme value function pq{y) cannnot be represented as the maximum 
function max (Pr{y) for all y E A: 

r=l,...,N 

Example 6 Let (p{y) = min{a:^ ‘ x <2y,x < -y} = < ^ ^ cl(Ai), 

xeR" [ P 2 {y), y € c 1 (A 2 ). 

Here the quadratic functions pi(y) = 4?/^ and p 2 {y) = y^ are valid only on the local stability sets 
cl(Ai) = (—00,0] and cl(A 2 ) = [0, +oo), respectively. The extreme value function p{y) itself is on 
R^ convex and continuously differentiable, but it is neither the maximum or minimum of the local 
quadratic functions pi{y) and <^ 2 ( 2 /)- 
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To find a valid representation of ^Q{y) as the maximum of a finite number of differentiable functions 
we use instead of the quadratic continuation of ^r{y) a minimal convex extension function. 

Definition 7 Let f{x) he a convex differentiable function defined on a neighborhood of a nonempty 
closed convex set C C R”. Then we define the minimal convex extension function /c : R” -> of 
f with respect to C via 

fc(x) = sup { f{z) + V'^f(z) -{x-z) : zeC ] . (23) 

Remark: This definition of the minimal convex extension function fc is mostly analogously to those 
of the finite extensions of convex functions in [16], where under additional assumptions dom(/c) = R” 
is obtained. 

Lemma 8 Under the assumptions from Definition 1 the function fc{x) is convex on R*^, and the 
relation fc{^) = /(^) is valid for all x G C. Furthermore, for each convex function g : 
with g{x) = f{x) X e C the relation g(x) > fc{x) holds for all a; G R”. 

If in addition f{x) is a quadratic function defined on C, then the minimal convex extension fc{x) is 
differentiable on R*^. 

Proof: Since the linear functions lz{x) = f{z) + V^f{z) • {x - z), z e C, are convex on R”, the 

supremum sup lz{x) is also a convex function. Furthermore, we have 
zee 

epi(/) = (C X R') n f| epi(Z^) = (C x R‘) n epi(/c) 

zee 

and hence fc{^) = f(x)WxeC. 

Since g{x) is a convex function with g(x) = f{x) V a: G C, the graph of each function lz{x), z ^ C, 
is also a supporting hyperplane of epi(p). From this we get immediately epi(p) C epi(/c) and hence 
g{^) > fc{^) for all a; G R”. 

In the case f{x) = ^x^Qx p^x + q (with Q = we have 

fc{x) = sup I ^z'^Qz -f p^z + 9 + (p + Qzy '(x-z) ] 

= p^x + g + sup { (x- \z)^Qz } 

zee ^ ^ 

= p^x + q- inf { (^z - x^Qz } , 

i.e., /c(^) consists of an affinlinear part (p^x + q) and a infimum of the convex quadratic function 
Sx(^) = ( 2 ^ ~ x)^Qz with the parameter x in the linear part of Sx(z). Hence, results from convex 
parametric optimization (cf. [6]) can be applied to determine the subdifferential of fc(x): 

dfc(x) = p + {VxSx(^:) =Qz:ze Arginf^^c5x(^)} • 

Now we have to take into account that for each optimal solutions z^, ^ of the quadratic problem 

inf the relation Qz^ = holds (cf. theorem A. 4.3. in [1]). From this we get immediately 

zee 

that dfc(x) is a singleton and hence fc is differentiable. ■ 

In general, the function fc(x) is not differentiable as can be seen by the following example. 
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Example 9 Let C = {x e : \xi\ < 1, X 2 < 0} and f{x) = 



0 , 






^2 + kl| < 0, 

X2 - |a;i| > 0, 



|(|xi| H- X 2 )^, otherwise. 



Then f{x) is convex and continuously differentiable on but the function fc{x) fails to be differ- 
entiable at all points (0,X2) with X 2 > 0 (cf. Picture 2). 



£ 



3 

2 

fc(x) 

1 

0 



Picture 2: Example 



for f{x) and fc(x) 



Applying the minimal convex extension to the functions (Pr{y) now we get a valid representation of 
^q{v) ns maximum of finitely many differentiable functions. 





Theorem 10 For the optimal value function (fqiy) defined by (21) with a symmetric positive semi- 
definite matrix Q a finite number of differentiable functions fr{y) : R”^ R^ exist such that the 
representation (pn(y) = max fr{y) holds for all y £ A. 

r=l,...,N 

Proof: Let ^r(y) be the quadratic functions accordingly to (22). For each r = we 

construct the minimal convex extension function fr := (<^r)d(^ ) of Tr{y) with respect to the local 
stability set cl(>l^). Prom Lemma 8 then we get the convexity and differentiability of fr(y), and we 
have the relation fr{y) < g{y) for all convex functions g with g{y) = g?r{y) for y £ Ar- 
Since (fqiy) is a convex function with (22), we can choose g(y) = (fqiy) and obtain (pQ{y) > fr{y) 
for all y £ A. Hence, from (22) directly follows the statement (pqiy) = max^/r(y) for all y e A. m 

4. Conclusions 

It has been shown in this paper, that the optimal value function of type (1) with onedimensional 
parameter y £ R^ under weak assumptions can be represented as a maximum of a finite number 
of differentiable convex functions. The same result can be obtained directly for multidimensional 
parameters y G R”, if problem (1) is a linear or quadratic program. 

Such global representation (14) as maximum of differentiable functions ensures that the marginal 
function <^jo(2/) better properties than a general convex function, especially with respect to the 
set of points where ^jo{y) is not differentiable. 
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A Numerical Approach to Optimization w.r.t. 
Variable-Dependent Constraint-Indices 



W. Achtziger, University of Erlangen-Nuremberg, Erlangen, Germany 

Summary: We discuss the minimization of a continuous function on a subset of subject to 

a finite set of continuous constraints. At each point, a given set- valued map determines the subset 
of constraints considered at this point. After a brief discussion on the existence of solutions, the 
numerical treatment of the problem is considered. It is motivated why standard approaches generally 
fail. A method is proposed approximating the original problem by a standard one depending on a 
parameter. By choosing this parameter large enough, each solution to the approximating problem 
is a solution to the original one. In many applications, an upper bound for this parameter can be 
computed, thus yielding the equivalence of the original problem to a standard optimization problem. 
The proposed method is applied to the problem of optimally designing a loaded truss subject to local 
buckling conditions. 

1. Introduction and problem statement 

The modelling of applied optimization problems often leads to structures which are more complicated 
than considered in standard optimization problems. By a “standard optimization problem” we mean 
here the minimization of an objective function / : iR” — > M under a given set of constraints. In 
some models, however, the set of constraints depends on the variable. 

The general problem discussed in this paper has the following mathematical structure: We consider 



functions /, ^i, . ■ . ,^m • X — > M defined on a (non-empty) set X C 


Moreover, a set-valued 


mapping J : X — > Vm is given where Vm denotes the power set of {1, . 
With these data we define the minimization problem 


Vm := 


min fix) 


(Pi) 


s.t. Qiix) < 0 for all i € J[x), 




xeX . 




The interesting feature of (Pi) lies in the given set- valued map J which allows the set of constraints 
to vary with x. Here the set J[x) may change at each x e X, and may also be empty at some x. 
Note that it may happen that gi{x) > 0 for some i ^ J{x) though x e X itself is feasible for (Pi). 
Problem (Pi) reduces to a standard problem if J(x) is constant for all x e X. Throughout the paper 


we make the following “general assumptions”: 




/ and ^ 1 , . . . , Pm sire continuous on X] 


(GAl) 


X is closed in JR”; 


(GA2) 


(Pi) possesses a feasible point; 


(GA3) 



Apart from (GA2) X may be arbitrary, e.g., X = iR", or X is a discrete set, X c In applications, 
X typically represents additional constraints as given in standard problems by functions hj : iR” — > 
iR, j = 1, . . . , A:, A: + 1, . . . , AT, in the form 



X = {xeM^ I hj{x) < 0 Vj = 1, . . . , A:, hj(x) = 0 Vj = A; + 1, . . . , K} . 



( 1 ) 
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If each function hj is continuous then X is closed as required in (GA2). 

The feasible set of (Pi) is denoted by 

X\\= {x ^ X \ gi{x) < 0 for all i € J{x)) . 

Throughout the text we fix some feasible point (note (GA3)), say x ^ X\. 

Example 1 We consider the set X \= {x e IB? \ X\ > 0, X2 >0, X\ + X2 < 2}, and m ;= 1 
constraint function gi : X — > iR, gi{x) Xi-{-X2 — l. The mapping J : X — y V\ is given by 



J{x) := 



{1} if xi = 0 or X 2 = 0, 
0 otherwise. 



For Xi = 0 or a ;2 = 0 the constraint g\{x) < 0 leads to the feasible set 

Xi=X\{ (0,af, («,0f |Ka<2} 



of (Pi). In particular, note that the vertices (0, 2)^ and (2,0)^ of X are not feasible (i.e., ^ Xi). □ 



2. Existence of solutions and a practical condition 

We start with a brief discussion on the existence of solutions to (Pi). By “solutions” we mean here 
and in the sequel a global minimizer of / on the feasible set Xi of (Pi). Instead of considering the 
whole feasible set Xi it suffices to restrict to an appropriate level set of /. We take the point x from 
above, and put 

Xi{x) :={xeXi\ f{x) < f{x ) } . 

In standard problems, i.e., if J is constant on X, the existence of a solution to (Pi) is guaranteed 
if X\(x) is bounded: Then Xi{x) is compact due to (GAl) and (GA2). In the general situation, 
however, boundedness of Xi{x) is not sufficient as Ex. 1 shows: E.g., the function f{x) := —xi does 
not attain its infimum function value —2 on the feasible set (cf. (3)). Here, the main hindrance for 

the existence of a solution is obviously that the feasible set Xi is not closed. Speaking in terms of 

topology, the existence of solutions heavily depends on continuity properties of the set-valued map 
J. Such considerations are made in many books on topology as, e.g., in [6]. Our main intention is 
the numerical treatment of (Pi), and thus we restrict ourselves to problems for which J is given in 
a form that can be numerically used. In the sequel we consider maps J represented by “indicator 
functions” Zi, i = 1, . . . , m, in the following way: 

There exist continuous functions zi,. . .,Zm ‘ X — > ]R such that 

{ Zi{x) = 0 for all X e X with i e J{x), (Az) 

Zi{x) > 0 for all a; € A with i ^ J[x). 

This condition is purely motivated by practical applications where functions Z{ with (Az) are directly 
given by the modelling (e.g., cf. Sec. 4). Assumption (Az) means that J(x) can be written as 



J(x) = G {1, . . . , m} 1 Zi{x) = o} for all x e X. 



(4) 
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One should note that the construction of functions Zi with this property is straightforward while 
the main assumption hidden in (A 2 :) is the continuity of the Zi’. It is easy to prove that (A 2 ;) 
implies the upper semi-continuity of the set- valued map J. Unfortunately, lower semi-continuity 
is needed to imply that Xi{x) is closed (cf. [6]). In general, however, this is not the case as Ex. 1 
shows: The infimum is not attained for f{x) := —xi (cf. above), but (Az) is satisfied, e.g., for 
zi{x) := min{a;i,X2}. 

On the other hand, the mathematical structure (4) of J opens (Pi) to a simple numerical approach 
(cf. Sec. 3) which finds plenty of applications. For this purpose, we have in mind that the Z{ are 
“simple” such as linear functions or the pointwise minimum or maximum of linear functions. 

To enforce the existence of a solution, let us consider a sharpening of (A 2 ;) which, again, is motivated 
by practical applications: In the sequel, we sometimes assume the following condition: 

There exists e > 0 such that x € X and i ^ J[x) implies Zi{x) > e. (Ae) 

At a first glance, it seems that this assumption contradicts the continuity of the functions Z{ in 
(A 2 :). This, however, is not the case if X is partitioned into disconnected sets (cf. below). We note, 
however, that this assumption is not essential for the numerical approach presented below in Sec. 3. 
Here (Ae) is considered just as an alternative to the property that Xi{x) is closed (cf. Cor. 2 below) 
because in certain applications the functions Zi can be chosen in a way such that (Ae) is easier to 
verify than the property that Xi{x) is closed. 

It is easy to prove that the assumptions (Az) and (Ae) imply the lower semi-continuity of J. By this it 
can be shown [6] that Xi (and thus Xi{x)) is closed. Together with some boundedness argument this 
finally assures the existence of a solution to (Pi). In order to keep the presentation self-contained, we 
go here a different way: First we clarify the meaning of assumption (Az). The existence of solutions 
to (Pi) is then a simple conclusion (cf. Cor. 2 below). With the notation 

Xj{I) :={xeX\ J(x) = 1} for I eVm 

one can prove (solely by the use of standard arguments from continuous optimization) that (Ae) is 
almost equivalent to the closedness of the sets Xj{I): 

Proposition 1 Let (Az) be satisfied. If condition (Ae) holds then Xj{I) is closed for all I e Vm- 
Vice versa, if X is bounded and if Xj{I) is closed for all I € Vm then (Ae) holds. 

(Simple examples show that the boundedness of X is essential in the second assertion.) 

The following corollary is an immediate consequence of Prop. 1 and (GAl): 

Corollary 2 If ( Az) and ( Ae) are satisfied then Xi (x) is closed. If, moreover, (x) is bounded then 
(Pi) possesses a solution. 

Proposition 1 shows that (Ae) is a strong condition: Since each of the sets Xj{I) is closed, they 
must be disconnected. Therefore, the set X falls apart into the (closed) sets Xj{I) on which J( . ) 
is constant. Hence, problem (Pi) can be split into a collection of problems of standard type where 
for each I € Vm the function / is minimized on the set 

Xi{x \ I) := {x e X \ f{x) < f{x), gi{x) <0 'ii € I, Zi(x) = 0 '^i e I, Zi{x) > e Vi ^ /}. 
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Therefore, if X\{x) is bounded, each of these standard problems possesses a solution provided 
Xi{x I /) ^ 0, and each x* € argmin{/(a:^) | / € Vm) is a solution to (Pi). Unfortunately, this 
result does not suggest a numerical approach for solving (Pi) in practice: The number of standard 
problems to be checked is \Vm\ = where in applications often m > 100 (cf., e.g.. Sec. 4). 

3. A numerical approach by successive approximation 

In this section we investigate a general approach for the numerical treatment of (Pi). We assume that 
J is represented by functions 2 ;*, i.e., we assume (Az) to hold (but not necessarily (Ae)). The main 
idea for a numerical approach is the approximation of (Pi) by a problem (P 2 {P}) which depends on 
a parameter P > 0 indicating the “degree of approximation”. The approximating problem (P 2 {P}) 
should have the following properties (A) and (B): 

(A) : For fixed P, (P 2 {P}) is a continuous optimization problem in standard form with a solution x^ . 

(B) : For P — > -hoo (or, for P — > 0) a subsequence of the solutions x^ converges to a minimizer x* 
of (Pi) (whenever such x* exists). 

As a first candidate for such a procedure the well known penalty technique comes in mind which 
penalizes the violation of “difficult” constraints by adding a positive term (“penalty”) to the objective 
function (cf., e.g., [4, Ch. 9]). In our situation the “difficult” constraints are the inequalities gi{x) < 0 
for all i e J{x), i.e., for x 6 X we should add a penalty term to f{x) whenever gi{x) > 0 for some 
i ^ J[x). With a glance on (Az), one could think, e.g., of an auxiliary problem (P 2 {P}) such as 

min$oo(^) with $oo(^) •= {/(^) + P • max max{pj(x), 0}) 

xGX i:zi{x)=0 * 

where P > 0 is a large parameter (parallel to the ^ 00 -penalty in Nonlinear Programming [4]). 

For any choice of a penalty function, however, the auxiliary function ^ can be discontinuous (de- 
stroying any standard numerical optimization approach). The reason is that gi{x) may stay positive 
independently from Zi{x) which may be zero or positive (cf. (A 2 :)). For example, consider Ex. I with 
Xi from (3): The penalty must be zero on Xi, but it must be positive, e.g., at (0, 2)^. 

The continuity of the auxiliary function $ and the property of the penalty to be zero at x if and only 
if X is feasible in the original problem are of course undeniable requirements of any penalty method 
(cf., e.g., [4]). Similar arguments hold for the use of barrier methods. 

Another attempt to numerically solve (Pi) with the issues (A) and (B) from above is (what we call 
here) the “perturbation technique”. This technique is frequently used in Structural Optimization: 
One perturbs the original problem (Pi) by slightly modifying the constraints depending on a (small) 
parameter F > 0 such that the perturbed problem (P 2 {P}) coincides with the original problem (Pi) 
for P = 0. The perturbation is chosen such that (A) is satisfied. It is well known from the field 
of Parametric Optimization that the perturbations must be “properly” chosen to assure (B). To see 
this, we only add a small example: 

Example 2 Consider the objective function f{x) := I — (x 2 + 2)l yjx\ + (x 2 + 2)^ for X and J from 
Ex. I. With s := (0, -2)^, this function simply minimizes the angle between the vectors {x — s) and 
(0, 1)^. Hence (note (3)), x* is a solution to (Pi) if and only if x* = (0, a)^ with 0 < a < I. For 




87 



fixed 0 < P < 1 put X{P} := {x e X \ Xi > P, X2 > P}, and consider the modified problem 

^mm}/(x) (P,{P» 

s.t. gi{x) <0 if 1 6 J{x). 

For P = 0 problem (P 2 {P}) becomes (Pi)=(P2{0}). Since J(x) = 0 for all rr > 0 (cf. (2)), for 
P > 0 problem (P 2 {P}) reduces to the minimization of / on X{P}, and hence possesses the unique 
minimizer x^ := (P, 2 — P)^. Hence, for P \ 0+ the optimizers x^ converge to (0, 2)^. This point, 
however, is neither optimal (cf. above) nor feasible (cf. (3)) for (Pi)=(P2{0}). □ 

Now we propose a numerical approach that satisfies the issues (A) and (B). By the above we see that 
the objective function should not be modified for treating the difficulties. Further, it is (generally) 
also not a good idea to exclude feasible boundary points from the feasible set. These observations 
lead to the following perturbation approach: We consider the problem 



mm /(a:) (P 2 [P]) 

S.t. gi{x) - P • Zi(x) < 0 for all z = 1, . . . , m, 
xeX, 

where P > 0 is a fixed parameter (Here and in the sequel we use the notation (P 2 [P]) instead 
of (P 2 {P}) to avoid confusions with the problems defined above). Problem (P 2 [P]) satisfies the 
requirement (A). Analogously to A'l from above, we denote the feasible set of (P 2 [P]) by 

X 2 {P) := {x e X \ gi{x) - P • Zi{x) < 0 for all z = 1, . . . , m} . 



Moreover, for x £ X we introduce the notation 

P(x) := max ( max , o) (5) 

where the convention max0 = — oo is used. The following proposition clarifies the role of P. Its 
proof is straightforward only using the basic properties of the functions Zi given in (A 2 :). 



Proposition 3 Let (Az) be satisfied. 

(a) //O < Pi < P 2 then A' 2 (Pi) C A^ 2 (P 2 ) C 

(b) Let X £ Xi. Then x £ ^^(P) if and only if P > P{x). 



Summarizing, the feasible set of (P 2 [P]) is always contained in that of (Pi), and each x feasible for 
(Pi) is also feasible for (P 2 [P]) for large enough P. We note that the feasible set X 2 {P) of (P 2 [P]) 
is non-empty for all P > P(x) by (GAS) and by Prop. 3(b). 

The following theorem clarifies the existence of solutions to (P 2 [P]). It also makes transparent the 
numerical approach of solving (Pi) via (P 2 [P]). As ususal, “min(P)” denotes the minimum objective 
function value of a problem (P), and analogously “inf(P)” for the infimum. 



Theorem 4 Let {Az) be satisfied, and assume that Xi{x) is bounded. Then the following holds: 

(a) For each P > P{x) problem (P 2 [P]) possesses a solution. All solutions are contained in Xi{x). 

(b) For each P > P{x) the inequalities -00 < inf (Pi) < min(P 2 [P]) hold. 

(c) For P — > -f- 00 , the optimal function value o/(P 2 [P]) converges monotonically decreasing to the 
infimum function value of {Fi), i.e., min(P 2 [P]) \ inf (Pi) as P — > -foo. 
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The proof of this theorem is first grade analysis. Note that the existence of a solution to (Pi) has 
not been assumed in Thm. 4. Thus also the closedness of M\{x) (or the stronger condition (Ae)) is 
not needed to achieve the infimum objective function value of (Pi) via (P 2 [P]). 

Let us now discuss issue (B) from above. The following theorem shows that (P 2 [-P]) realizes an 
“exact” approach: Instead of letting P tend to +oo it suffices to consider P which is large enough: 

Theorem 5 Let (Az) be satisfied, and let (Pi) possess a solution x*. Then for each P > P{x*) the 
problem (P 2 [P]) possesses a solution. Moreover, each solution to (P 2 [P]) is also a solution to (Pi). 

The proof of this theorem simply relies on Prop. 3. It shows that (Pi) and (P 2 [P]) for any fixed 
P > P{x*) can be regarded as “equivalent” for practical purposes if only one and not all solutions 
to (Pi) must be computed (This enables the derivation of optimality conditions for problem (Pi): 
Simply write down appropriate optimality conditions for (P 2 [T^]) where P > P{x*)). Note again that 
the technical (strong) condition (Ae) is not needed for Thm. 5 to hold. 

^From a formal point of view, Thm. 5 parallels results of exact penalties in Nonlinear Optimization: 
For these penalty functions, a lower bound P^*^ can be given such that each solution x^^^ to any 
penalty problem with P > P^*^ is an exact solution to the original problem. Formulas for P^*^ 
usually depend on the Lagrange multipliers at some optimal solution to the original problem (e.g. 
[4, Thm. 9.3.1]). For the majority of practical applications, however, these multipliers cannot be 
estimated in advance, and the exactness property is almost useless for the practical solution. On 
a first view, this objection also holds for the formula of P{x*) (cf. (5)). However, contrary to 
Lagrangian multipliers, in many applied problems it is easy to derive upper bounds for P{x*) which 
then can substitute P{x*) (cf. Thm. 5). For simplicity, we only state a result where (Ae) is satisfied 
to illustrate the practical use of Thm. 5: 

Corollary 6 Let {Az), and (Ac) be satisfied, and let Xi{x) be bounded. Moreover, let constants 
ai e IR, i = 1, . . . ,m, be given such that a{ > max | sup (a:) | x G A'i(x), Zi{x) > e}, o}. Then 
(Pi) possesses a solution, and P** := (maxiai)/c has the properties of P{x*) in Thm. 5. 

In many applied problems each pi is bounded on X (or on A'i(x)), or even X is bounded (or Xi{x)). 
In these cases, often very simple calculations suffice to derive numbers Oj, z = 1, . . . , m, with 

ai > max{ maxpj(x), o|, resp. with Oj > max{ max gi{x), o| . (6) 

x£X J '' xGcl(A'i(x)) J 

It is trivial to see that these relations imply the inequalities in Cor. 6 (and thus a parameter P**). 
It remains to discuss the numerical treatment of (P 2 [Pj). Obviously, this question depends on further 
particular properties of /, pi, Z{, and the set X. As already indicated above, the functions Zi should 
be chosen “simple”, i.e., the Zi should be “as simply tractable as the pi are”. For example, if / and 
the pi are linear, and X is given by (1) with linear functions hj then (P 2 [F^j) is a (standard) linear 
programming problem if the Zi are linear as well. If (Pi) is a “generalized convex problem” (i.e., 
/, Pi, hj are convex, and equality constraint functions hj are affine linear) then, if possible, each Zi 
should be chosen as a concave function in order to preserve the convex problem structure for (P 2 [Pj). 
In general, however, (P 2 [T’j) is a global (i.e., non-convex) optimization problem. Such a situation is 
considered in the following section dealing with a real life example from Structural Engineering. 
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Figure 1: (a) Ground structure; (b) Solution structure [“realistic” buckling]; (c) Solution structure 
[“simple” buckling] ;(d) Part of ground structure; (e) Scenario I; (f) Scenario II; 



4. An application: optimal truss design including buckling conditions 

The computation of optimal truss designs is one of the classical problems in the field of Structural 
Optimization. A truss is a pin-jointed framework in two dimensions (=2D) or in 3D which is con- 
structed from bar elements. For simplicity, we consider the particular example of designing a 2D 
cantilever arm. Figure 1(a) illustrates the given “design space”, i.e., the preliminary given data: We 
consider a rectangular grid of 11 x 3 so-called “nodal points”, and M = 90 connections (edges) called 
“potential bars”. The triangles at the left hand nodes indicate that these nodes are fixed (e.g., to 
some rigid wall) while all other nodes are allowed to make small movements ( “displacements” ) when 
the structure (yet to be designed) is under load. Per free node we have two nodal displacement 
directions yielding the dimension AT := 60 of the system of free nodal coordinates. The nodes and 
bars form a so-called “ground structure” (cf. [5]). The task is to optimize the cross-sectional areas 
ai of the potential bars i = 1, . . . , M w.r.t. given material properties (same material for all bars), 
and such that a given load / can be carried. In Fig. 1(a) the given load is indicated by an arrow 
and applies at the bottom right hand node pulling vertically to the ground. In the system of nodal 
coordinates this / is given as a negative unit- vector / = — G IR^ (where j corresponds to the 
vertical free nodal displacement direction of the bottom right hand node). 

As the objective function we consider the total volume (resp. weight) of the structure which is 
minimized. Besides the cross-sectional area Uj > 0 of the i-th potential bar we consider the internal 
force Qi acting along this bar as an auxiliary variable. The length of the i-th potential bar is denoted 
by ii. 

Our main interest is the inclusion of “realistic” buckling constraints: Consider some bar of length 
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with cross-sectional area a* > 0. We assume that the geometry of its cross-sectional area is invariant 
under scaling (e.g., is a square). Then there exists a constant s* > 0 such that the buckling member 
force computed by Euler’s formula 



buckling _ 

9 * - 



( 7 ) 



Therefore, to prevent buckling of this bar, one has to require that (where < 0 means 

compression). For simplicity, we assume that the geometry of the cross-sectional area is the same 
for all bars, i.e., that s := si = • • • = sm- Since we allow some/many of the bars to get zero 
cross-sectional areas (“topology problem”) it is easily seen that formula (7) hides a discontinuity: 
The length 4 depends on the considered design a because it may happen that several potential bars 
must be considered as a “unified long bar” in the structure: To see this, consider Fig. 1(d) showing 
a detail sector of the ground structure of Fig. 1(a). We see 15 potential bars (numbers in circles). 
First, consider a design a and forces q with ai = • • • = 03 > 0 and 04 = • • • = ais = 0. Then 
the real truss on the shown part of the ground structure looks as depicted in Fig. 1(e): We have to 
consider a “unified long bar” being the union of the (potential) bars z = 1, 2, 3 “lying on a line” and 
having (the same) cross-sectional area ai > 0. All (potential) bars connected to the “intermediate 
nodes” (between the bars 1 2, and 2^3, respectively) from outside the unified bar possess zero 

cross-sections, i.e., Ei=4<^t = 0* By force equilibrium it is easy to see that qi = q 2 = q^ holds. Thus, 
for checking feasibility of (a, g) w.r.t. buckling in this (“unified”) bar, we get the condition 



Here the value in (7) is the sum ^1 + ^2 + ^3 because the “unified bar” consists of the potential 
bars z = 1, 2, 3 (and thus has total length ^1 + ^2 + ^3)- 

Second, consider a slightly modified couple (a', q') where a[ = = 03 > 0 and a'^ = a'5 = = Ug = 

a'lo = a[i = a’i 2 = clu = 0 (as above), and where o!j = a'^ = > 0. The situation is shown in 

Fig. 1(f). In this case, the potential bars 1 and 2 form a “unified bar” while bar 3 forms some other 
(separate) bar. We get the two buckling constraints (cf. (7)) 



9'i>- 



(^1 + ^2) 



r(a'i)^ and 9^ > -^(4) 



This shows that “realistic” buckling constraints to be considered at some point (a, q) depend on the 
design a. Obviously, the modelling of this difficulty is highly complex: All situations in the ground 
structure must be covered wherever potential bars lie on a line (cf. Fig. 1(d): bars 4 ^ 12, or 5 ^ 11, 
etc.). This general difficulty is well known among scientists [5, 8]. A computational model, however, 
still seems to be unknown. In the following paragraph we try to state such a model based on a 
combinatorial approach (for details cf. [1]): 

Consider all sequences of potential bars in the ground structure which lie on a line, and which are 
connected by common end nodes. Such a sequence is called a “chain” , and its nodal points connecting 
two bars are called “intermediate nodes (of this chain)” (as above). For example, c := (1,2,3) in 
Fig. 1(d) is a chain. By definition, let also each single potential bar form a chain (without an 
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intermediate node). The set of chains in some given ground structure is denoted by C. For the 
ground structure in Fig. 1(a) we get \C\ = 253 chains (after the deletion of symmetric ones). The 
length i{c) of each chain c e C is defined by the sum of its member lengths (e.g., for c = (1,2,3), 
£{c) := + ^2 + 4)- Moreover, for each c e C, we define a vector r(c) G {0, 1}^ such that for any 

design a > 0 the term r(c)^a is the sum of the cross-sectional areas of those bars which are connected 
“from outside” the chain c to intermediate nodes of this chain (e.g., r{c)i := 1 for z = 4, . . . , 15, and 
r{c)i := 0 otherwise; r{c)'^a = Sji 4 aj). Therefore, the bars in some chain c must be treated as a 
long unified bar if and only if r(c)^a = 0. If we collect the bar indices of a chain c = (zi, Z 2 , . . . , z/b) 
in the set B{c) := {zi,Z 2 , • • . ,2*:} then (7) leads to the “realistic buckling constraints” 



for all c e C with r(c)^a = 0 and all i e B{c) . 



These constraints are part of the following problem which in the sequel is referred to as (Ti): 



M 

min Lai 

a,qelRM “ 



s.t. Bq-\- f = 0, 



—aai < Qi < aai for all z = 1, . . . , M, 

Carea — d for all Z — 1 , . . . , , 

—Qi - ^ all c G C with r(c)^a = 0 and all z G B{c), 

-ai <0 for all z = 1, . . . , M. 



( 9 ) 

(10) 

( 11 ) 

( 12 ) 

( 13 ) 



Equilibrium of external forces / and member forces q is modelled by (9) where B G contains 

the geometry information on the ground structure such that qi <0 (resp. qi > 0) denotes compression 
(resp. tension) of the z-th bar. The constraints (10) bound the absolute stresses \qi/ai\ (if ai > 
0) where cr > 0 is a given constant. The conditions (11) model construction laws (“slenderness 
conditions”) reducing a possible failure of the truss: If the z-th potential bar is contained in the 
(real) structure (i.e., if Oj > 0) then a^ > Carea must hold where Carea is a specified minimum area 
^area > 0. In addition, formulation (11) takes care of the case Uj = 0. 

One has to check that this model problem coincides with reality, i.e., indeed leads to scenarios as 
indicated in the Figs. 1(e) and (f). For these proofs the explicit structure of B must be used (cf. [1]). 
With the notations n := 2M, m := |{(c, z) | c G C, z G B{c)}\, 



q) 

J{a,q) 



£(c)2“* 

{ceC\ r(cfa = 0} 



for all c G C and all z G B(c)j 



for all (a,q) € X — where X contains all couples (a,q) satisfying the constraints (9), (10), (11), 
and (13) — (Ti) is a problem of the type (Pi). For the ground structure in Fig. 1(a) it can be 
shown (with standard settings for a and s; cf. [2]) that (GA3) is satisfied. Moreover, (GAl) and 
(GA2) hold, and the feasible level set is bounded (cf. [1]). The construction of the functions Zi in 
(Az) is straightforward: Simply define the linear(!) functions Z(c,i)(a,q) := r(c)^a for all c G C and 
all z G B(c). Due to the slenderness constraints, also condition (Ae) is satisfied with e := €area (If 
r(c)^a > 0 then aj > 0 for some j. By (11) it is aj > Carea? and hence r(c)^a > Carea)- 
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Summarizing, Cor. 2 guarantees that (Ti) possesses a solution. It turns out that (Ti) generally does 
not possess a solution if the constraints (11) are omitted. Also the idea of a perturbation approach 
working with positive lower bounds on the bar areas generally fails (for small examples cf. [1]). 

For the numerical treatment of (Ti) we use the approach from Sec. 3. In the sequel the problem 
corresponding to (P 2 [-P]) is called (T 2 [P]). From Thm. 5 we deduce that each solution to (T 2 [P]) is 
a solution to (Ti) if P is large enough. By simple calculations, constants o:(c,t) can be derived which 
satisfy the right hand inequality in (6). Thus, Cor. 6 tells that it suffices to solve the problem (T 2 [P**]) 
where P** := (max(c,j) Q^(c,i) )/^area- Due to the negative quadratic terms in the bar areas, this problem 
is not convex. On the other hand, mixed terms (like ajUj, i j) are not present. Therefore, it was 
profitable to create an adapted optimization algorithm which uses this “almost linear” problem 
structure. Details on this algorithm can be found in [2]. It represents an SLP-method of feasible 
descent. Figure 1(b) shows the solution structure a* computed by this SLP-technique: The values 
aj > 0 are visualized by the diameters of the shown bars. Black indicates tensile members while grey 
means compression. A careful investigation of this particular structure shows that a* (together with 
the computed forces q*) is indeed a global minimizer to (Ti). 

Classical problem formulations in areas and forces including buckling conditions are close to (Ti), 
however, working with the “simple” constraints Qi > —Siof/lf for alH = 1, . . . ,M instead of (12) 
(cf., e.g., [3, 7]). A (scaled) solution structure a* to this problem is depicted in Fig. 1(c). Obviously, 
the design a* is fundamentally different from d*. As argued [5, 8] this proves that the consideration 
of the classical ( “simple” ) buckling conditions may be not sufficient in a pre-optimizing state of the 
design process to get a good design of a structure satisfying “realistic” buckling conditions. 
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Minimization of Maximal Torque of a Flywheel Robot 



P. Flisberg, T. Larsson, P.O. Lindberg, S. Grahn, G. Johansson, 
Linkoping University 

Summary: This paper treats the optimization of the design and control of the Linkoping Flywheel 
Robot with respect to maximal torque. The robot consists of three links, where the inner rotates 
with constant angular velocity to handle major displacements and the two outer are used for local 
positioning. The objective is to minimize the maximal torque in the two outer joints, subject to 
the constraints that the Tool Center Point is still at the gripping and placing operations. The link 
lengths, the points of time for the pick-and-place operations, and the trajectory of the manipulator 
are variable. The link movements as functions of time are expressed by B-splines. Two different 
solution approaches are developed and tested in experiments. In the first, the maximal torque is 
approximated using an integral and the constraints are handled via a quadratic penalty function. 
In the second approach, an augmented Lagrangean reformulation is done; we use exponential and 
quadratic penalty functions, respectively. Our numerical results are better than those obtained earlier 
with less advanced methods. 

1. Introduction 

The handling capacity of robots is of great industrial importance. A case in question is the handling 
of light weight objects on conveyor belts where the flow of objects significantly exceeds the capacity 
of one robot. Such applications arise in food industries with high production rates and relatively 
light products; prospective robots are then required to have low cycle times. The difficulty with 
rapid motions, however, is the great power needed to accelerate the robot manipulator. 

The Linkoping Flywheel Robot [7], which was developed at the Division of Assembly Technology 
at Linkoping University, is intended to be used for rapid movements of light objects. [Other robot 
concepts developed for this purpose are the SCARA (Selective Compliance Assembly Robot Arm) 
robot [4](p. 268) and the Delta project [3].] The robot has three links. It picks objects on one side of 
the robot and places them on the other side (see Figure 1). The three links are moving horizontally. 
The most important difference compared to conventional robots is that the manipulator arm is in 
motion when the Tool Center Point (TCP), which is the gripping part of the robot, is still (relative 
to the base of the manipulator) in the picking and placing operations. 

The mechanical concept for this robot is a hybrid of a light and a robust structure. The inner link 
is designed to be robust, and not necessarily light; it might instead be beneficial with a relatively 
large mass and inertia since that would give a more stable run. This link will work as a flywheel, 
rotating with constant angular velocity, and it takes care of major displacements of the objects to 
be moved. The fine displacement are handled by the two outer links. They also allow the TCP to be 
held still at pick-and-place operations. The outer links are light compared with the inner one, and 
are both allowed to rotate freely in both directions. One cycle of manipulator movement consists of 
four phases: 



Phase 1: hold the TCP still for approximately 0.05 seconds at the picking position. 
Phase 2: move the TCP to the placing position. 
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Figure 1: The Linkoping Flywheel Robot seen from above. 

Phase 3: hold the TCP still for approximately 0.05 seconds at the placing position, and 
Phase 4: move the TCP to the picking position. 

The basic design objective is to reduce the accelerated mass (active mass and inertia) of the links of 
the robot. It is essential to reduce the weight of the robot since there is a correlation between weight 
and cost. It is further advantageous to concentrate on minimizing the maximal torque needed to 
perform an operation since this will decide the size of links, gears, etc. Low maximal torque values 
will also decrease wear and tear of the motor. Minimizing the maximal torque will therefore keep 
down the cost and size of the robot parts. Because of the large moment of inertia for Link 1, and 
since it rotates with constant angular velocity, the value of the torque is low for Joint 1. 



2. Problem formulation 

Given the vector Q = (^ 1 ,^ 2 , ^ 3 ) of relative angles of the different joints (see Figure 2), and link 
lengths I = (I 1 J 2 J 3 ), the position P{6\1) of the TCP can be evaluated as 




We further let the relative angles be functions of time, that is ^ = 6{t). In Figure 2 the robot 
manipulator is shown at three different points of time. One cycle takes 0.5 seconds and this time 
interval is denoted by T. Further, 0i{t) = An. The points of time when the picking (at position 
Ap) and placing (at position Ad) operations begin are denoted by tp and respectively. The times 
when the picking and placing should begin are not settled. The only requirement is that the TCP 
must be fixed at the proper locations (at Ap and at Ad) for at least 0.05 seconds each. Knowing the 
configuration of the links when Phase 1 or Phase 3 of the cycle begins, the movements during the 
remainder of that phase are uniquely determined by the motion equation 



P{0(t)\l) = Am, t e [tm,<m + 0.05], m = p,d. 



( 2 ) 
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Figure 2: Link positions at three points of time. 

Introducing ri 2 {t) and ri 3 (t) to be the torques at Joints 2 and 3, respectively, the design problem is to 
minimize the maximal values of these torques during a whole cycle, given a specific picking position 
i4p, and a specific placing position Ad. The link lengths, the angular functions, and the picking and 
placing times, tp and td, respectively, are to be determined. 

The optimization problem under consideration is 

( 2 ). 

Given the derivatives 6{t) and 6{t), the torques 7i2{t) and ns{t) can be computed through a triangular 
nonlinear system, the Newton-Euler equations [4](p. 200). It should be noted that these torques 
can be computed in a direct way (without iterative loops) from 0{t) and its derivatives. First, link 
velocities and accelerations are computed recursively from Link 1 out to Link 3. Second, forces, 
torques of interaction and joint actuator torques are computed from Link 3 back to Link 1. As in 
[5], point masses are assumed to be located at the ends of the links, with the mass 10 kg for Link 1 
and 1.25 kg for each of the outer links. 

Grahn and Johansson considered the problem P in [5]. To describe the angular function 6 they use 
fifth degree polynomials. At the picking and placing operations they derive the values d(t) analytically 
(assuming h = h), and to make 6 continuously differentiable they have boundary conditions on the 
polynomials. To minimize the maximal torque under these constraints on the polynomials they use 
the minimax routine in MATLAB, see [9], which uses a Sequential Quadratic Programming method 
[2]. This will be referred to as the SQP-method. 

3. Statement of an approximate problem 

The problem P is defined over an infinite-dimensional space of angular functions, and it contains an 
infinite number of constraints. To be able to treat it numerically, it is approximated in two ways. 
First, the infinite-dimensional function space is replaced by a finite-dimensional subspace, which is 
spanned by B-splines. Second, time is discretized. 
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We will express the angles %(t), j = 2,3, in terms of (third degree) B-splines [ 8 ], which 

are twice continuously differentiable, see Figure 3. Due to the rotational movement, it is, however, 
beneficial to introduce a basic constant rotation, the deviation from which is expressed by the B- 
splines. Hence, we have that 

9 

= Qjt + i = 2, 3 (3) 

t=-3 



in terms of control variables aij. It turns out that a good basic rotation is given by qj = (-l)J+i 87 r, 
j = 2,3. Each B-spline has support in a compact interval [^,,^,+ 4 ], where the U:s are the nodes of 
the splines. A consequence of this is that the Hessian matrix of the torques, etc., with respect to the 
variables will be banded, which is advantageous. 



m) 

A 







Figure 3: B-splines. 

Due to the cyclic nature of the motion we impose the requirement that the initial and final states of 
the manipulator shall be the same. To achieve that, we introduce the constraints 

0>—3j — 2 j “ 3>nd 0,—lj — CLgj^ j — 2, 3, 

which makes 



0}{t\o.) = Qjt + arjBi^it) + OijBiiit) + 09 ^Sii(t) + aojB^(f) + • • • + ogjBl{t), j = 2, 3. (4) 

The functions 9 and 0 will be lower order polynomials. 

A disadvantage of the B-spline representation is that we can not achieve that the TCP is exactly 
still in the Phases 1 and 3 of the cycle. In practice, it is, however, not necessary for the TCP to be 
exactly still when the picking and placing operations are performed. For a pick-and-place robot in 
the food industry an error of 1 mm is often acceptable. We therefore introduce a tolerance, €, which 
defines the maximal allowed distance between the position of the TCP and the picking and placing 
positions Ap and Ad during Phases 1 and 3, respectively. 

We could alternatively compute the movements in the picking and placing phases according to the 
Equation ( 1 ), but we would then have to impose constraint on the weights of the splines in order to 
make 0 continuously differentiable at the change to spline control. 

The position of the TCP as function of the variables a^, the link lengths, and the time is denoted 
P{t\a,l). It is defined by Equation ( 1 ), and by the definition of 6{t\a) given by Equation ( 4 ). 
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Introducing tf = (tp,td) and ^(a, /) = 
tion is 



\rij{t\a^ /)|, the optimization problem under considera- 

t£T,j =2,3 



(P$) min ^(a, /) 

s.t. ||p(tla, 1) - < e, te [tm, tm + 0.05], m = p,d, (5) 

which has 25 continuous variables. (To simplify the presentation, some trivial restrictions have here 
been suppressed.) Problem contains constraints corresponding to two (a priori unknown) subsets 
of the whole time interval T. (Recall that the points of time tp and U are variables.) It might be 
approximated as 

{Pt) min^(a, /) 

a,l,tf 

S.t. T4„(t,U(||.P(«|a,0->lml|^-e) <0, t G T, m = p,d, (6) 

where T^p {t, tp) is a twice continuously differentiable function of the shape which is shown in Figure 4; 
the function TAd(^, td) is defined analogously. The nonlinear parts of these functions are built up by 
third degree polynomials. The formulation Pt is a minimax problem, with a finite number of outer 




tp tp -f 0.05 



Figure 4: The function TAj,{t,tp). 

variables, a, /, and t/, and a continuum of inner variables, rij{t). Due to the maximum operation the 
function ^ typically is nondifferentiable. (Hence, standard gradient-based optimization techniques 
cannot be used.) Furthermore, the problem Pt is, in general, non-convex. 

To obtain a finite set of constraints, the problem is time discretized with time intervals of 0.005 
seconds; the points of time are denoted r = [ti, where K = 100. 

4. Mathematical optimization approaches 

We test two different approaches to handle the minimax objective function. The first is a penalty 
approach, and the second utilizes augmented Lagrangean functions. 

4.1 Penalty approach 

We here approximate the nonsmooth function ^ with the smooth function 

$^(a,0 = -in 

M \j=2 k=l 



( 7 ) 
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As /i -> 00 , this approximation becomes more accurate, but at the same time more ill-conditioned. 
In order to obtain an unconstrained problem, we include the constraints into the objective by means 
of a quadratic differentiable penalty function. With 






( 8 ) 



where r = 1 and r = 2 correspond to the two coordinate directions, we obtain the problem 

2 K 

{Pp) min “ $^(fl,/) + C ^ ^ iQri'T’kWi ^m)) ? 

m—p^d r=l fc=l 

where c is a penalty parameter, which should have a large value to make the constraints satisfied. 
To overcome the ill-conditioning for large values of /z and c, we apply a second-order method for 
the minimization of /, t/). (It might seem futile to attempt second order methods, given the 
complexity of our problem. But due to the representation of 9 through B-splines, and since each 
B-spline function is nonzero only on a bounded interval, the Hessian of the smoothened function 
$^(o, /) and also of will be banded, and computable at reasonable cost.) 

As noted, the torques at the joints are calculated through a sequence of involved computations. How- 
ever, since the computations are arranged in a triangular way, we may use automatic differentiation 
to determine first- and second-order derivatives at low additional computational cost [10]. To do 
this, we use the package ADOL-C [6]. 

Newton’s method is used to solve Pp, but since the objective function is non-convex, we include 
Levenberg-Marquardt’s modification [l](p. 98) to ascertain that descent directions are obtained. For 
the line searches we use the Armijo step length rule [l](p. 25). 

Most of the computing time is spent on the second-order derivatives. Therefore we also tried to 
use conjugate gradients, where second derivatives are not needed. However, this led to much longer 
solution times, probably due to the ill-conditioning. 

In order to determine the value of c needed to make the constraints fulfilled with sufficient accuracy, 
we solve Pp for increasing values of c. In each main iteration, s, we perform ten Newton iterations 
to get an approximate solution to Pp for c = c®; the approximate solution is denoted (a*,/^tj). If 
the violation of the constraints has not decreased enough since the last update of the penalty, it is 
increased, i.e. 






if Y. EZl5'-('ntK,i‘,OI >»? T, EZk<-(rit|a’ \1‘ 

m—p,d r=l k=l m=p,d r=l k=l 

& otherwise. 



( 9 ) 



where /? = 5 and rj = 0.25, see [l](p. 347). 

The overall solution method described above is referred to as the PM-method. 



4.2 Augmented Lagrangean approach 

In the second approach we rewrite the problem as 
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(Pl) min(fi 

a,i,tf 

s.t. \rij{Tk\a,l)\ - ^ <0, j = 2,3, k = \,...,K (10) 

Tx„(rifc,U(|(mKO-^m)J-e) < 0, k = l,...,K, r = l,2, m = p,d.{ll) 

We use augmented Lagrangean reformulations with respect to all constraints, and with exponential 
or quadratic penalty functions. Let A and 7 be vectors of multipliers accociated with the Constraints 
(10) and (11), respectively. (There are 600 multipliers totally.) With 

hj{Tk\a, 1) = \Tij{Tk\a, l)\-<j>, ( 12 ) 



the augmented Lagrangean function with exponential penalty is 

{Pl.) Ll(a, t/, A, 7 ) = 0 + — E E A^* - l) + 

j=2k=l 

- E EE7mr*(6-^*^<^‘i“'''‘")-i). 

^62 m=p,dr-lk=l 

Using the notation [•]+ = max{0, •}, the augmented Lagrangean function with quadratic penalty is 
{Pl<i) Lg(a, /, t/, a, 7 ) = 0 + 01+ ~ "h 

j=2 k=l 

2 K 

^ E EE {[TmrA: + Cq, 2 pr(' 7 fc|u, /, ^m)]^. Tmrfc} • 

m=p4r=lk=l 

The augmented Lagrangean subproblem for Pl^ is, for given vectors A and 7 , 



{Pl-) minL^(a,/,t/,A, 7 ), 

and the subproblem Pi^ for Pl« is defined analogously. 

For the augmented Lagrangean functions, as well as for the penalty function z^^c{o>i fh® Hessian 
will be banded but not positive definite. Therefore, Levenberg-Marquardt’s modified Newton method 
is used to solve Pl« and P^. Also for these subproblems the package ADOL-C is used to evaluate 
the first- and second-order derivatives. The Armijo rule is used to determine the steplength, and ten 
Newton iterations are made to get an approximate solution to a subproblem for given values of the 
multipliers and the penalty parameters. The solution is denoted as in Subsection 4.1. 

As proposed in [l](p. 347), in the exponential case, the multipliers are updated according to 






and in the quadratic case, according to 

and 7 ^+^ = [ 7 ^^* + c*^Sr(r*|o’,r,C)]^ • 

After updating the multipliers, each of the two penalty parameters is increased if the total violation 
in the corresponding set of constraints has not decreased enough over the previous minimization; 
cf. the penalty update (9). In the case of the exponential penalty function, we used ^ = 1.2 and 
rj = 0.25, and for the quadratic case, p = b and rj = 0.25. The solution approach described above 
are referred to as the ALE and ALQ methods, for the exponential and quadratic case, respectively. 
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5. Experimental results, analyses and conclusions 



A good starting solution seems to be needed for the algorithms to arrive at solutions giving a smooth 
motion and low torques. At the initiation, the weights for the B-splines are generated so that the 
movement is smooth. Starting values for the multipliers in the augmented Lagrangean approach 
are obtained by inspection of the torques generated in the starting solution. The multipliers Xjk 
which are associated with large absolute values of nj{rk) are given large values, and vice versa. The 
multipliers for the positioning constraints are handled in the same way. 

For the exponential penalty function we could not use the updating formula stated above, because 
then the method became unstable. The method becomes stable if we instead update the multipliers 
as 



= max 






AjV 






= ‘fnax 






Tmrk^ 






where p = 10 and S = 0.0000025. 

We terminate the algorithm when the constraints are almost satisfied, i.e. 



• for the penalty approach, when 



E IZ||[^'>■(T|o^^^4)]+|| < £2, 



(13) 



• for the augmented Lagrangean reformulations, when 



H IZ||[S'r(r|a’,/’,4)^|| < £2 and 



E||[/‘l(r|a»,n]+||<e3 

3=2 



(14) 



where €2 = ea = 0.0001. (In principle, the complementary slackness conditions should also be- 
come satisfied, but this is very difficult to achieve since the augmented Lagrangean subproblems 
are not solved to exact optimality.) 



In order to enable a comparison of our results with those reached by the SQP-method presented 
in [5], we have used the same picking and placing locations as in that work; the picking location is 
(0.4m, 0.4m) and the placing location is (-0.4m, Om), relative to the base of the robot. One difficulty 
in comparing them is that the problem statements are not exactly the same. Grahn et al. use a 
method which makes the TCP stand exactly still in Phases 1 and 3; we instead use an approximate 
requirement for this positioning. We investigate the effect of restricting the outer links to have the 
same lengths by optimizing both with and without this restriction. In Table 1, the ALQd and the 
PMd methods are the methods ALQ and PM, respectively, applied to the problem P$ without the 
length restriction. In the other cases the restriction I 2 = I 3 is included. 

With our methods we get a 26% lower maximal torque compared to the results from the SQP-method 
proposed by Grahn et al. [5] when we allow a maximal tolerance of 1 mm in Phases 1 and 3, see 
Table 1. When, however, the tolerance is 0.1 mm, then the result gets much worse. This probably 
depends on that the remaining degree of freedom for the B-spline weights is quite small whence 
the picking and placing constraints have become fulfilled with sufficient accuracy. A way to resolve 
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Method 


SQP 


PM 


ALE 


ALQ 


PM 


ALQ 


PMi 


ALQi 


€ [mm] 


0 


1 


1 


1 


0.1 


0.1 


1 


1 


Iterations, s 


- 


11 


35 


28 


30 


46 


10 


31 


max{rij(r)} 


32.0 


23.59 


23.59 


23.59 


78.2 


78.2 


21.77 


21.96 



Table 1: Numerical results 

this weakness would probably be to divide the cycle into more intervals for the splines, with the 
disadvantage that we get more variables. 

The computing times are in the range 0.5 to 1.5 hours on a Sun Ultra 2 workstation. The Figures 5 
and 6 show some characteristics of the solution obtained with the PM method for e = 1 mm. 




Figure 5: The torques U 2 {t) and riz{t) as functions of time during the cycle. 




Figure 6: Link positions at seven points of time, and the movement of the TCP. 

The PM method seems to be preferable since it reaches the same result as ALE and ALQ, but in less 
number of iterations. However, the number of iterations needed to achieve a near-optimal solution 
varies with the choice of parameters. The PM method is also the most stable with respect to the 
choice of parameter values, while the ALE method is the less stable in this sense. 

It may be noted that the maximal value of the torques during a cycle can be reduced by over 7% if 
the length restriction is withdrawn. 
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We conclude by noting that the presented solution algorithms are quite general. They are usable 
for similar robots with some minor changes concerning the evaluation of the torques. The solution 
algorithms can, for example, be applied to a robot with three links and where the outer joint is 
prismatic, to a SCARA robot [4](p. 268), or to a robot with two links, where the joint between the 
links is prismatic. 

6. Further research 

We will implement a more advanced robot model, which includes a better description of the moment 
of inertia, and we will also study similar robot concepts. It is also of interest to study how the 
maximal torque changes with the picking position, and what the optimal link lengths would be if the 
picking position varies in an area instead of always being the same. Another interesting topic would 
be to study the minimization of the weighted sum of the maximal torque and intermittent work. 
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A Phase- 1- Algorithm for Interior-Point-Methods: 
Worst-Case and Average-Case Behaviour 



P. Huhn, University of Augsburg 

Summary: We discuss a phase- 1-algorithm for linear programming problems (LPs) and are in- 

terested in its worst-case and its average-case behaviour under the rotation-symmetry model. This 
stochastic model was the basis of the average case analysis of the simplex method in [1]. 

A phase- 1-algorithm shall provide a starting point for an interior point method (IPM), if the LP 
is feasible and bounded, or otherwise indicate the infeasibility resp. unboundedness of the LP. Our 
phase- 1-algorithm is a barrier-function method, adapted for approximating the analytic center of a 
polyhedron. 



1. Introduction 

1.1 Basic Notations for LPs 

We look at linear programming problems of the form: 

maximize v^x subject to a^o: < 1, . . . , a^x < 1 , 

where v,x,ai, . . . ,am € and m > n, m,n E IN. For abbreviation we use the matrix := 
(ai, . . . , am) and e := (1, . . . , 1) for the vector of all ones. Sometimes we write e(m) to emphasize the 
dimension of e € IR"*. We call v^x the objective function^ and Xp = {x\ Ax < e} the feasible region. 
The interior of Xp will be denoted by Int Ap, the boundary of Ap by dXp. Using slack variables 
we can formulate (P) as maxu^a: s.t. Aa; -f s = e, s > 0 (s € and the feasible region Xp as 
{(a:, s) G | Aa; -H s = e, s > 0}. 

To avoid numerous case-studies and to make a probabilistic evaluation possible, we agree in the 
following assumption of nondegeneracy: 

Each subset of n elements out of {ai, . . . , Om, v} is linearly independent and each 
subset of (n -f 1) elements out of {ai, . . . , a^n, —v} is in general position. 

We will see, that this assumption does in no way influence the results of our study. 

Remark 1 (P) provides the origin 0 to be - in any case - a feasible, interior point of Xp. Because 
of m > n the assumption of nondegeneracy provides Xp to be pointed. If Xp is unbounded, then 
there exists a direction d G IR" \ {0} such that Ad < 0, and because of nondegeneracy there is even 
a direction d with Ad < 0. 

With every LP there is another LP, called the dual problem. The dual problem of (P) is: 



minimize e^y subject to A^y — v, y >0. 

The feasible region of (D) will be denoted by Xd = {y e IR"* | A^y = v,y>0}. Prom duality theory 
we know, that Xp ^ (h Xd is bounded. I. e. the problem (D) cannot be unbounded because there 
is a primal feasible point. 
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1.2 Barrier Function and Analytic Center 

For a point x € Int(A'p) denote the harrier function of Xp by 

m m 

•AW := -Hln(l - afx) = (2) 

t=l t=l 

Lemma 2 (Properties of the barrier function 0) 

1. 0 is strictly convex in IntXp and = oo. 

2. If Xp is unbounded, then inf^eintXp (!>{^) = — oo. 

3. If Xp is hounded, then (j>{x) is bounded from below and attains its minimum in a unique point 
Xa ElntXp. This point is called the analytic center of Xp. 

If Xp is bounded, we want to approximate the analytic center and we have to measure the closeness 
of a point x £ Xp to the analytic center Xa> Therefore, we introduce 

g(x) = V(/)(x) = A^S~^e (gradient) , H{x) = V^(/)(x) = A^S~^A (Hessian) and (3) 
p{x) = —H{x)~^g{x) = —{A^S~^A)~^A'^S~^e (Newton direction), (4) 

where S = diag{s) = diag{e - Ax) (diagonal matrix). Now, we define a measure S for the distance 
of an interior point to the analytic center as the Hessian norm of the Newton direction p a,t x: 

S{x) ;= ||p(x)|U(x) = s/e^S~^A(ATS-^A)-^ATS-^e = ||5-'Ap(a:)|l. (5) 

II . II denotes the Euclidean norm. Note, that (J(x) = 0 implies g{x) = 0 and thus x = Xa- An 
equivalent definition is (5(a;) = miny{||5y-e|| | A'^y = 0, y G IR""} = ||5y(s)-e||, where the minimizing 
y is given by y{s) := + S~^Ap(x) = 5“^(/ - 5"M(A^5"2A)"M^5"^)e. A point x G IntXp 

with <5(a;) = a < 1 is called an approximate analytic center. For a more detailed discussion see 
[2] and [5]. 

1.3 Phase- 1 Problem and Phase- 1 Barrier Function 

For our phase-1 algorithm we do not consider the polyhedron Xp, but a slightly different polyhedron: 

Xp := {x G IR” \Ax<e, —v^x < 1}. 

Remark 3 Note, Xp C Xp and 0 G IntXp. Again, Xp may be unbounded. If Xp is unbounded, 
then there exists a direction d with v'^d > 0 (and Ad <0) and thus, the objective function of (P) is 
not bounded from above on Xp and on Xp, too. 

We use as slack variables s = e- Ax and = 1 + v'^x. The barrier function associated with X), 
is 

m m+l 

4>i(x) := -5^1n(l - ajx) - ln(l +v^a;) = - Insj (6) 

i=l t=l 

and we use := (s^, Sm+i), •= (y ^7 Vm+i) and A' := (A^, —v) to formulate the gradient, Hessian 
matrix and the Newton direction of (/>i at x analogously to (3)-(4) as 

Pi(x) = V0i(x) = = V^(x) = A'^(5')-M' and pi(x) = -H,{x)-^g^{x). (7) 
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Figure 1: 

These fig\ires show a bounded 
resp. an unbounded polyhe- 
dron Xp (white and gray) and 
the corresponding phase- 1 poly- 
hedron Xp (white). The addi- 
tional constraint < 1 in 

the definition of Xp cuts off the 
part colored grey. 



Lemma 2 holds for (^i and Xp similarly. If Xp is bounded, we denote the analytic center of Xp by 
x'^. The measure for the distance of an interior point to the analytic center is defined by 

<5i(x) := l|pi(x)|U.{.) = ||(5')-M'piW||. (8) 

Again, we can formulate 6i as di(x) = miny/{||5V - ^(m+i)|| I = 0, ?/' € = ||5V(s') - 

e(m+i)ll with 2/'(s') := (5')"^e(^+i) -f A' pi(x) , 

1.4 Complexity Theory 

For deriving worst case complexity results we use a discrete complexity model, where only rational 
data are admitted. In this context an algorithm is said to be polynomial if the computational effort 
can be bounded from above, polynomially in the encoding length L of the specific problem instance. 
The worst-case complexity analysis of our algorithm makes use of the following facts (compare [4]): 

Lemma 4 1. Each vertex of Xp is contained in a sphere of radius (centered at 0). 

2. If there exists an optimal solution to (P), then there exists one with \\xapt\\ < 

Corollary 5 The norm of any vertex of Xp is at most y/n 2 ^^^^ . If there exists a point x € Int X p 
with ||x[| > y/n2^^^\ then the objective function v'^x o/(P) is unbounded on Xp (and Xp). 

Remark 6 The bound y/n2^^^"^ for the norm of vertices is a worst-case bound. If we look at a 
specific problem instance of (P) resp. Xp the bound may be less than y/n2^^^\ 

2. A Phase- 1- Algorithm 

2.1 Properties of the Barrier Function 

We will start by presenting some fundamental lemmas, which explain the complexity of the algo- 
rithm. These lemmas are the result of adapting barrier function methods to the phase- 1 problem 
for approximating the analytic center of X), (see [2], [5]). We will write pi,gi,Hi and Si instead of 
Pi{x), gi{x), Hi{x) and (^i(x), whenever there is no doubt about the reference point x. 

Lemma 7 1. For x e X}> and d G IR” with ||d||//i(x) < 1 lye have x -\- d e IntX),. 

2. For all Si>0 we have (t>i{x) - <l)i{x + api) > Si - ln(l + Si), where a = (1 -h 

3. If Si < 1, then (j)i{x) > — 00 for all x G Xp, i. e. Xp is bounded. 
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Lemma 8 Let T £ JR,T > 0 be fixed. The barrier function (x) is bounded from below on the set 
Xp n {x G IR” I ||x|| < T], i. e. for all x £ X}, fl {x € IR” | ||x|| < T} we have 

m 

4>i{x) > -Eln(l + hi\\T) - ln(l + IkllT) =: Li(T). 

t=l 

Now, we define T{Xp) as the maximum of the length of the vertices of Xp: 

T{Xp) := Max{||x|| | x is a vertex of Xp}. (9) 

The calculation of T{Xp) is as difficult as solving the LP (P), so we will not calculate this figure, 
but we can use it for theoretical considerations and for the probabilistic analysis. 

Remark 9 We know, that Xp C {x|||x|| < T(Xp)}, if Xp is bounded. Otherwise, if Xp is un- 
bounded, then there exists a point x £ Xp with ||x|| > 

2.2 Algorithm and Complexity Analysis 

As our phase- 1 algorithm shall determine whether Xp is unbounded or not, we can explore the iterates 
resp. their norms on being greater or smaller than T(Xp). Unfortunately, we do not know the explicit 
value of T(Xp). But - given any x £ Xp - we can try to find a vertex x of Xp with ||x|| > ||x||. If 
no such vertex exists, then Xp is unbounded. This ’test’ can be done by Procedure 1, where we 
use the following notations: denotes the z-th row of A', then /(x) := {i \ a'^x = 1} is the index 

set of constraints active at x. For an index set I = {ii,. . .,ij} C {1, . . . , m -h 1} {j > 1) we define 
:= (ttii, . . .,ai.) and the projection onto the null space of A'j by Pj := E — A'j^(A'jA'j^)~^A'j . 

Procedure 1 Test; 

Input: X £ IntXp 

Initialization: xq := x; I := 0; I{xi) := 0; := x\; 

repeat 

if A'di < 0 then return true (exit); {Xp is unbounded in the direction di} 
else ai := max{o: | a > 0, x/ -h adi £ Xp}; [maximal step length} 

xi+i := xi + aidi; 

/,+i = P/,^. :=E- dM := ^ + I-' 

endif 

until I = n; 

return false; {x„ is a vertex of Xp with ||x/|| > |lxo||} 

Output: false: there is a vertex of Xp with norm greater than ||x|| or true: Xp is unbounded. 
Theorem 10 (Complexity and Correctness of Procedure 1) 

1. Procedure 1 terminates after at most n iterations and the effort of each iteration is at most 0{mn). 

2. For all n > I > 0 we have xi^i £ Xp and ||x/+i|| > ||x/||. 

3. If Procedure 1 returns true, then Xp is unbounded. 

4 . If Procedure 1 returns false, then x„ is a vertex of Xp with ll*n|| > ||xol|. 
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If we invoke Procedure 1 at a point x € Xp with l|a;|| > T(Xp), then Test(x) returns true; and now 
we can formulate the phase- 1 algorithm to find an approximate analytic center of Xp or to state 
that Xp is unbounded. 

Algorithm 2 Phase-1 Algorithm; 

Input: a; G IntXp 

Initialization: xq := x; k := 0; 6i{xk); 
while 6i(xk) > 1 and TEST(a;ib) = false do 
Pi{xk) := -Hi{xk)~'^gi(xk); d* := (1 + 

Xk+i ■■= Xk+akPi(xk); 

(5i(xt+i); Test(x*+i); k := k + 1; {Update} 
endwhile 

Output Xk with Si{xk) < I or TEST(xjfc) = true. 

Theorem 11 (Complexity and Correctness of Algorithm 2) 

1. After k iterations the value of barrier function is less than —0.3k. 

2. After Kt := 0.3~^lLi(T)| iterations we have ||x/Cy|l > T. 

3. Algorithm 2 terminates after at most 0(lLi(T(Xp))|) iterations, i. e. after at most 0{mL) 
iterations in the worst case (using the complexity model of paragraph 1.4)- 

4 . The effort of each iteration is at most 0{mn^). 

5. If Algorithm 2 terminates with TEST{xk) = true, then the phase- 1 polyhedron Xp is unbounded 
and the objective function v'^x of problem (P) is unbounded on Xp. 

6. If Algorithm 2 terminates with Si(xk) < I, then Xk is an approximate analytic center of Xp. 








Figure 2: 

These figures illustrate the behaviour 
of the phase- 1 algorithm. You see the 
polyhedrons Xp from the previous fig- 
ure and the balls {x \ |lx|| < T(Xp)} . 
The left figure shows a sequence of it- 
erates, which converges to the analytic 
center Xa. The right figure shows, that 
the algorithm finds a point G Xp 
with llxjfcll > T(Xp) and stops. 



2.3 Construction of a Starting Point for an IPM 

Our original goal was to find a starting point for an IPM to solve (P). But, up to now we only got 
an approximate analytic center of Xp. Before we construct a starting point, we want to show how 
to improve the proximity of an approximate analytic center. 

Lemma 12 If 6\{x) < \, then x^ := x-\-pi{x) G IntXp and Si{x+) < 
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If we once have x G IntXp with (Ji(a:) < 1, we can calculate a ’better’ approximation of the analytic 
center using pi{x). Lemma 12 implies, that these iterates converge quadratically to the analytic 
center. So, w.l.o.g. we assume to have x e IntXp with = a < 1 for some fixed a € (0, 1). The 
parameter a has to be chosen according to the starting conditions of the specific IPM. 

Here, we want to discuss the typical starting condition when we use a barrier function method as in 
[2]. We only employ the IPM to solve (P), if the phase-1 algorithm has stopped with < 1 and 
we know, that (P) is bounded. In this case the barrier function (/)p{x,fi) defined by 

'T ^ 

4>p{x,n) := ^ln(l -afx), 

P 1=1 

achieves its minimum over Xp at a unique point x(/z), where // is a positive parameter. x{fi) is 
characterized by the KKT conditions: Ax -h s = e, s > 0, A^y = v, y > 0, Sy = pe^m)- 
A measure for the distance of an interior feasible point x to the minimum point x(/i) is 

S{x,p) := \\p{x,fjL)\\H{x,n), 

where p[x,p) denotes the Newton direction of 0p(a;,/i) at x^ i. e. p{x,p) = —H(x,p)~^g{x,p), 
H[x,p) =A^S~^A denotes the Hessian matrix and g{x,p) =-^v-\-A^S~^e the gradient. Again, we 
can formulate (J(x, p) in different ways using y{s, p) = S~^e -I- S~‘^Ap{x^ p): 

6{x,p) = ||5"Up(x,/x)|| = - e\\\A^y = v,ye IR'”} = _ e|| , (iq) 

As we are looking for a starting point, we need a point x and a parameter p with S{x,p) = /? < 1 

for some ^ € (0, 1). 

Theorem 13 For given x G IntXp with (5i(a:) = a < 1, = (s^, Sm+i) y'{s')^ = ym+i) = 

i(S')~'^e(m+i) + define x := x, s := s, y := y~\iy ^^d p := 

Then we have x G IntXp, y G IntX£> and for some specific a,/3 e (0, 1) we have 5{x,p) = /?<!. 

3. Probabilistic Analysis 

We have seen that the (worst case) upper bound on the number of iterations of the phase- 1 algo- 
rithm depends on the encoding length L. But practical experience suggests that this upper bound 
overestimates the usual number of iterations. To explain this effect in a convincing theoretical way, 
we will carry out a probabilistic analysis. 

3.1 The Rotation-Symmetry-Model 

For calculating the average number of iterations (or related expectation values) one has to make 
assumptions on the distribution of the data of (P). As mentioned before, we will base our probabilistic 
analysis on the Rotation-Symmetry-Model, which demands that 

• • • j flmj ^ are distributed identically, independently and symmetrically 
under rotations on IR” \ {0}. 

We specialize this model here to the uniform distribution on the unit sphere ojn in IR” and refer to it 
as um-RSM. Note, that this model gives nondegeneracy the probability 1 and thus, degeneracy can 
be ignored in our analysis. 
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3.2 The Average Number of Iterations (Part 1) 

With these probabilistic settings we can consider T{Xp) and Li{T(Xp)) as random variables de- 
pending on Oi, . . . ,am,v. Prom Theorem 11 (part 3) we know that \Li{T{Xp))\ is essentially the 
problem-specific number of iterations of Algorithm 2 and we can get the average number of iterations 
of Algorithm 2 by calculating E[\Li{T{Xp))\]. As the figure Li{T{Xp)) is quite intricate we will 
look on Li{T{X},)) := (m + 1) ln(l -f T{Xp)), which is an upper bound on \Li{T{Xp))\ because of 
||t;|| = ||ai|| = 1 for alH = 1, . . . , m in the um-RSM. It follows that 

E[\L,(T{X},))\] < E[L,(T(Xl.))]. (11) 

The expectation value is evaluated with respect to the distribution of T{Xp). So, we are interested 
in this distribution function, which will be denoted by Fp. But, again an exact derivation Fp is quite 
complicated and therefore we will obtain an approximate distribution function F with the following 
property: F{t) < Fp{t) Vi G (0, oo). 

The distribution F puts more weight on the great values of t than the exact distribution Fp and this 
variation of weights leads to 



Ep^nxm < E^[L^(T(X}.))], (12) 

where Epj,[.] is the expectation value with respect to the exact distribution function Fp and E^;[.] 
is the expectation value with respect to the approximate distribution function F. 

3.3 An Approximate Distribution Function 

First, we look at the exact distribution function, i. e. the probability that T{Xp) is less than t. 

Fp{t) = P{T{Xp) < t) = P{\\x\\ < Fi veTtices X of Xp) 

= 1 — P (there exists a vertex x of Xp with Ikll > t)- (13) 

If we can find an upper bound on P (there exists a vertex x of Xp with l|o:|| > t), we get an lower 
bound on Fp{t) and can use this lower bound to define the approximate distribution function F. 
For the evaluation of the probability it is standard to use the polar interpretation as in [1] and [3]. 
The advantage is that the random events can be explained directly by use of the random input 
vectors ai, . . . ,am,CLm+i '= The use of am+i instead of -v will simplify the coming notations 
and formulas. Here, we will not elaborate this polar interpretation, because it would require many 
additional (technical) terms and definitions. Nevertheless, we want to give the reader an idea, how 
to approximate the distribution function and therefore we introduce the following notations: 

Each vertex x of Xp is the unique (because of nondegeneracy) solution of a system a^ix = 1, 
. . . , a^nX = 1 with A = {A\ . . . , A”} C {1, . . . , m + 1} and it has to satisfy afx<l for all 
iG{l,...,m + l}. A is the index set of active constraints (at x). To distinguish between different 
vertices, we will label a vertex with its index set and denote it by x^. 

Given n linearly independent points xi,. . . ,Xn G we denote the hyperplane through Xi, . . . , 
by H{xi, . . . , Xn), the distance of H(xi, . . . , a;„) to the origin by h{xi, . . . , x^), and by H~{xi, . . . , x„) 
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we denote the halfspace bounded by H{xi,. . . ,Xn) and containing the origin. Using these notations 
we have the following relation: is a vertex of X}> ^ aj £ H~{ai, i € A) Vj € {1, . . . , m -f 1}. 

Moreover, is normal to the hyperplane H{ai,i e A) and \\xa\\ = l/h{aij e A). 

In the RSM the probability that a point x is lying in the halfspace € A) can be described 

by a function G : [-1, 1] [0, 1] which depends on the height h{ai,i G A) (compare [1]). Given the 

unz-RSM the explicit formula for this distribution function G is given (for /i(aj, i € A) >0) by 



P{x e e A)) 



G{h{aiJe A)) = 1 - 



An-2(^n-l) 

>^n-l(^^n) 






In order to develop an integral formula for the probability 



P(there exists a vertex x of Xp with ||x|| > t) (14) 

we will write F(aj) for the distribution function of the vectors a^, i = 1, . . . , m + 1 and we will use 
indicator functions !{.} to formulate the event mentioned in {.}. For further considerations we 
remark, that all vertices x satisfies ||a;|| > 1 and therefore we can assume t to be greater than 1 and 
set Frit) = P{T(X},) <t) = 0 V t < 1. 

So, we can start and formulate resp. approximate the probability in (14) as follows: 



P(there exists a vertex x with ||a;|| > t) = 

f (m+l) 



nm+L) 

~ J ^{there exists a vertex x with ||x|| > ■ j ®m+i)dP(ai) • • • dF(afn^i) 

R" 

/-(m+l) 

— ^ J is a vertex with HxaH > • • • > ^m+i)dF(ai) • • • dF(am+i) 



without loss of generality we can assume A = {1, . . . , n} 
n 



_ /m + l\ 

~ \ n ) J is a vertex}(^*i5 • • • )<^rrH-i)l{||a:^|| > ^)(t, Ui, . . . ,an)dP(ai) • • •dP(a^+i). 



We have 



/ (m-n+l) 

l(a:A is a vertex} (^i) • • • ? ^n)<^P(fln+i) * * * C^P(Om+i) = ^(^(fli? • • • , 



and P(there exists a vertex x with ||x|| > t) < 
rm+l\ /•(") 



( m -f 1 \ /* 

R" 



Next, we apply a transformation of coordinates as in [1]. We replace the vectors ai, . . . , by vectors 
6i,...,6n with the property, that b'- = h for some h with 0 < h < 1 for alH = l,...,n. The 
determinant of the Jacobian of this transformation can be taken into account by Xn-i{un) Det(R), 
where the matrix B consist of the rows (6j, . . . , 1) for i = 1, . . . , n and An-i(o;n) denotes the 

Lebesgue measure of Un- Furthermore, let bi := (6- , . . . , b^~^) for 2 = 1, . . . , n and f(bi) the density of 
Then we have \\x^\\ = l//i(ai, . . . , 0 ^) = l//i > 1, because after the transformation the distance 
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to the origin of the hyperplane H(ai , . . . , On) is simply h < 1 . So, we get 
P (there exists a vertex x with ||x|| > t) 

0 Rn-1 

Reorganizing the order of integration we can evaluate the multiple integral over with re- 

spect to the 6j’s. Therefore we use the function g{h), which is the density corresponding to G{h), 
i. e. g{h) = 



An 2{(^n i)^ _ um-RSM (compare [3]) 



An— l(^n) 



n Bemim) • • • ^ 

ji-. -/vn; 1 (2,r)’-ir(!lf2)r(!ii^ + i) ’ 

Substitution of this formula in ( 15 ) and ‘T ll < -^sl. delivers 

P (there exists a vertex x with l|a;|| > t) 

i/t 

< .. f G(/i)”*-"+' g{h){l - /i2)("-D(7-i)d/i 

\ n / \/n — 1 J 






( 16 ) 



The first inequality in ( 17 ) follows from g{h) < (1 - and approximating the remaining 

integral according to X? f{x)dx < {b- a) M 3 Xxe[a,b]ifi^)}- 

Now, we have an approximation for P(there exists a vertex x with ||a;|| > t) of the form: 

j • - • G f - j 

On the other hand the probability is smaller than 1 for all t, and therefore let i satisfy 

_ »-(”■:■)■)■«())’■"■=•■ _ (■«) 
Obviously, there exists a t > 1, which satisfies ( 18 ). We only have to choose i sufficiently large 
(depending on m, n), because 0.8n 7 * G(y)’^“”'^^ is greater than 1 for t = 1, monotonically 

decreasing for growing t and it tends to 0 for t -> 00. 

Using i we can specialize the upper bound on the probability under consideration to: 

1 for all (1 <)t < f, 

P(there exists a vertex x with ||x|| > t) < { zm a. 1 \ 1 /•lx m— n+1 

0.8n(”*^^)i.G(|) foralH>£ 

Note, that for simplification we use G Q) instead of G on the right side. 

This upper bound can be used to define a lower bound on Frit) via ( 13 ) and to define an approximate 
distribution function F in the same way: 



0 



Frit) > 



for all 1 < t < t. 



l-0.8n("^^^) i-G(i)"* " forallt>f, 
F satisfies F{t) > 0, P is monotonically increasing and limt_^c» F{t) = 1. 



=: F(t). 
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3.4 The Average Number of Iterations (Part 2) 

With this explicit formula for F, the calculation of Ep[Li{T{Xp))] is easy and we get an upper 
bound on the average number of iterations of Algorithm 2 by using E^[Li{T{Xp))], (11) and (12). 

S[|ii(T(Xi))|] < Ep[L,{T(X},))] = %[(m + 1) ln(l + r(Xi ))] 

= (m + 1) J ln(l + t)dF(t) = 0.8n(m + 1) (”^^ ^) G (|)"* j j 

= (m + 1) ^ln(l +f) + f In j j < (m + 1) (ln(l + 1) + f ln2) . (19) 

In order to get an upper bound on the average number of iterations which depends on the parameters 
m, n only, we still have to find the explicit value of f, resp. an upper bound for some t with 

Any upper bound on a i satisfying (20) (and such t itself) delivers an upper bound on i. 

Using simple, but somewhat technical approximations of the left-hand side in (20) one can show for 
n > 3 and m > 5n that i := 20\/n - 1 satisfies (20). 

If we take into account that (by Theorem 11) Algorithm 2 terminates after at most 0(|Li(T(Xf))|) 
iterations and if we use the above value of f, then we have established the subsequent theorem: 

Theorem 14 For n > 3, m > 5n the average number of steps of Algorithm 2 is at most 0(my/n) 
under the uni-RSM. 
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Towards a Unified Framework for Randomized Pivoting 
Algorithms in Linear Programming 



L. Finschi, Eidgenossische Technische Hochschule Zurich 
K. Fukuda, Eidgenossische Technische Hochschule Zurich 
H.-J. Liithi, Eidgenossische Technische Hochschule Zurich 

Summary: We study linear programming (LP) algorithms. Of particular interest are bounds 

for the number of elementary arithmetic operations necessary to solve a linear program. The best 
bounds that depend only on the sizes of a basis and a nonbasis have been found for a family of 
randomized pivoting algorithms. However, the original descriptions and analyses of these algorithms 
use several different geometric and abstract settings. In this paper we present a unified framework in 
which we describe two known algorithms as special simplex methods and analyse their complexities 
and differences. 

1. Introduction 

A linear programming (LP) problem is to find a maximizer (or minimizer) of a linear function over a 
system of linear inequalities. We study LP algorithms and upper bounds for the number of elementary 
arithmetic operations necessary to solve LP problems; for this we assume that each operation can 
be performed in constant time. In particular we are interested in bounds that depend only on the 
size m of a basis and the size n of a nonbasis. To find an LP algorithm that is polynomial in m and 
n — usually this is called strongly polynomial — is one of the most challenging open questions. 

The simplex method of Dantzig [2] and also the algorithms of Kachiyan [9] and Karmarkar [8] do not 
satisfy this condition. New algorithms with complexity bounds in terms of m and n were introduced 
by Megiddo [11], Seidel [13], and others, but the dependence on n remained exponential. In 1992 
Sharir and Welzl [14] and also Kalai [6] presented randomized algorithms which turned out — after 
proofs first of Kalai and then of Matousek, Sharir, and Welzl [10] — to have a socalled subexponential 
expected running time. The proofs rely on similar ideas, whereas the descriptions of these algorithms 
use very different frameworks. 

Our main motivation is to understand the Sharir- Welzl algorithm (here abbreviated by MSW) and 
Kalai ’s algorithm as pivoting algorithms so that we can consider them as refinements of the simplex 
method and compare them with any other pivoting algorithm on the same ground. In 1995 Gold- 
wasser [5] showed that MSW and some variant of Kalai’s algorithms are dual to each other. Kalai 
[7] claimed in 1997 that another variant of his algorithms and MSW are equivalent in a dual setting. 
This paper compares these last two algorithms in our framework and shows that their behaviors 
are slightly different. The main importance is not the diffence we found but the simplicity and the 
preciseness of our framework that permit rigorous analysis and even straightforward implementations. 

2. The Framework: Dictionaries and Oracles 

For a rigorous treatment, we present in this section the definitions and notations which we use for the 
description of the algorithms in the following sections. The reader who is familiar with dictionaries 
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and pivot operations may directly go to Section 3. 

2.1 Dictionaries and Pivot Operation 

For two finite and nonempty sets R and C a.n Rx C -matrix M consists of the entries Mij for i 6 R, 
j £ C. For nonempty subsets I C R^ J C C we denote the submatrix corresponding to the entries 
Mij for ie I, j e J by Mjj. We set M/ := M/c, M.j := Mrj, and Mj := etc. 

We assume without loss of generality that an LP problem is given in the following dictionary form: 

maxxf subject to Xg = Dxjf, Xg = \,xe> 0, (1) 

where D is a given rational B x TV-mat rix for index sets 5 = B \J {f] and TV = TV U {p}, and 
£■ = 5 U TV; X is a variable vector, where E B U TV. D = D(B) is called the dictionary, B and 
TV the corresponding basis and nonbasis, respectively; we set m := |B| and n := |TV|. 

We consider a linear program P as given by (1). For i e B, j e N holds: We can find a dictionary 
form of P with basis B\{i}u {j} if and only if Dij ^ 0. The replacement of B by 5 \ {i} U {j} for 
Dij ^ 0 is called the pivot operation on {i,j). The pair (i,j) is called the pivot The new dictionary 
D = D{B \ {z} U {j}) after a pivot operation on (i,j) can be computed in time 0(mn). 

We remark that the dictionary form of the dual linear program is 

max 2/^ subject ioyj^= -D'^y^^yf = l,yE> 0. 

Compared with the primal dictionary form (1), basis and nonbasis variables interchange, in particular 
/ and g interchange, and the dual dictionary of D is -D^. 

A rational E- vector x is called a basic solution (of P), if there exists a basis B of F such that xn = 0 
and Xg = 1 (and then x-^ = D.g). We call this x = x(B) the basic solution corresponding to B. In 
order to find the optimal solution we have only to visit the basic solutions: If there exists an optimal 
solution of P, then there also exists an optimal solution of P which is a basic solution. 

For a linear program P given as in (1) and for F C F the contraction problem of P with respect 
to R is defined as P with additional constraints xr = 0 and denoted by P/R. If F C TV we can 
write P/R in the form (1) by substituting D.n\r for the dictionary (i.e. we delete the columns F 
in the dictionary D). The value x/ of an optimal solution x of P/R is denoted by v{P/R)\ if P/R 
is infeasible we set v{P/R) := -oo, if P/R is unbounded we set v[P/R) := -|-oo. We will use the 
obvious extension of the linear order on the real numbers to — oo and +oo. 

Definition 1 (Properties of a Dictionary) A dictionary D = DiJB) is called feasible if Dbq > 0, 
inconsistent if exists i £ B such that Dig < 0 and DiN < 0, and degenerate if exists i £ B such that 
Dig = 0. We call D dual feasible, dual inconsistent, or dual degenerate if the dual dictionary is 
feasible, inconsistent, or degenerate, respectively. 

The dictionary is called optimal if it is feasible and dual feasible, unbounded if it is feasible and 
dual inconsistent, terminal if it is optimal or inconsistent or dual inconsistent, and it is called nearly 
optimal w.r.t j 6 TV if it is not optimal and if D.N\{j] is optimal. 

We call a linear program P nondegenerate if all feasible dictionaries are nondegenerate; there is an 
obvious dual notion of this definition. Remark the correspondences between a dictionary D(B) and 
the basic solution x{B). 
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Definition 2 (Simplex Pivot) Consider a feasible dictionary D. We call (i, j) a simplex pivot if 
Dfj > 0, and Dij < 0 and if after a pivot operation on {i,j) the new dictionary D is feasible. 

By a pivot operation on a simplex pivot (a simplex pivot operation) the objective value does not 
decrease {Dfg < D/g). If D is nondegenerate, the objective value increases [Dfg < D/g)- There 
exists a simplex pivot (i,j) in D if and only if D is not terminal. 

2.2 Oracles for Dictionaries 

We will use the following oracles which reflect properties of the sign structure of dictionaries: 

• The value of Terminal(D, R) is True if the dictionary D is terminal for the contraction 
problem P/R (i.e. if D.n\r is terminal), otherwise False. 

• The value of NearlyOptimal(D, /^) is (True, j) if the dictionary D.n\r is nearly optimal 
w.r.t. j e AT, otherwise (False, 0). 

• For a feasible and not terminal dictionary D and j £ N such that Dfj > 0, the value of 
Pivot (D,j) is a dictionary after an operation on a simplex pivot (i,j) in the given column 
(this simplex pivot is unique for nondegenerate linear programs). For input (D, j) that does 
not satisfy the above requirements the value of the oracle is the given dictionary D. 

In a usual implementation the running time of these oracles will be 0(mn), 0(m H- n), and 0{mn). 

3. The Algorithms 

3.1 Preparation: Simplex Algorithm 

The algorithms we will discuss in the following are variants of the primal simplex algorithm which 
is reviewed here in dictionary notation. We use the oracles of Subsection 2.2. 

Algorithm Simplex: 

Input: A feasible dictionary D, deflning a linear program P as in (1). 

Output: A terminal and feasible dictionary D* for P, i.e. D* is either optimal or unbounded. 

Simplex(D) 

begin 

while not Terminal(D, 0) do 

choose any pivot column j e N such that Dfj > 0\ 

D Pivot (D,j) 
end while; 
return D 

end. 



In the following algorithms the choice of the pivot column will be given by randomized rules based 
on the sign pattern of the dictionaries. 
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3.2 Algorithms and Main Results 

We present in this subsection an algorithm of Matousek, Sharir, Welzl [10] and an algorithm of Kalai 
[7] with the main complexity theorems. The proofs are given in Section 4. The two algorithms are 
called here MSW and Kalai and work with a feasible starting dictionary and for nondegenerate 
linear programs. 



We present MSW as a primal simplex algorithm (so dualized compared to the original algorithm): 

Algorithm MSW: 

Input: A feasible dictionary D defining a nondegenerate linear program P, a set P C N. 

Output: A terminal and feasible dictionary D* for P/R, i.e. D* is eithor optimal or unbounded for 
PIR. 

MSW(P,P) 

begin 

if R = N then return D 
else 

choose e e N\R ai random; 

D := MSW(D, Pu{e}); 

if Terminal(T), R) then return D 

else 

D := Pivot (£), e); 
return MSW(P, P) 
endif 
endif 

end. 



Theorem 3 (Analysis of MSW) The expected number of pivot operations that are generated by 
MSW(D,0j is at most ^ algorithm is finite and terminates with correct output. 



Here is our description of Kalai ’s algorithm: 

Algorithm Kalai: 

Input: A feasible dictionary D defining a nondegenerate linear program P, a set PC N. 

Output: A terminal and feasible dictionary D* for P/P, i.e. D* is either optimal or unbounded for 
P/R. 
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Kalai(D, R) 
begin 

if Terminal(D, R) then return D 
elseif \ i?| = 1 (with N\R = {e}) then 
D := Pivot (£), e); 
return D 

else 

(v,e) := NearlyOptimal(D, i?); 
if t; = True then 

choose e e (N\R)\ {e} at random; 
else choose e € N\R a.t random; 

endif; 

D := Kalai(A ^U{e}); 
return Kalai(£), R) 
endif 

end. 



Theorem 4 (Analysis of Kalai) The expected number of pivot operations that are generated by 
Kalai(D,0^ is for n > 0 at most e^V^m+nHr^ ^ i ^ ji = q there is no pivot operation generated. 
The algorithm is finite and terminates with correct output. 

Both algorithms are indeed variants of the simplex algorithm: Obviously there will be some sequence 
of simplex pivot operations, and the algorithms will return a dictionary which is terminal for P/R 
if they terminate (observe that in both algorithms the argument of any return statement either is 
a dictionary which is terminal for P/R or a recursive call with the same contraction set R). 

4. Discussion of the Algorithms 

4.1 Analysis of MSW 

We discuss in this subsection the algorithm of Matousek, Sharir, Welzl MSW. The analysis is the 
same as in the original paper [10], but instead of the notation of socalled LP-type optimization 
problems we use the setting of dictionaries (see Subsection 2.1) and oracles (see Subsection 2.2). 

We write v{R) instead of v{P/R). Given a linear program P, a nonbasis ]V of P, and RC E, then 
e e E is called active with respect to {N^R) if v{RU {e}) > v{N). The set of all active constraints 
is denoted by A(N,R), and its cardinality by a^^R := \A(N,R)\. This definition has the following 
relation to the corresponding definition of enforcing constraints from the original MSW paper [10]: 
e enforcing in (P, P) if and only if e 0 A{N,R). The following two monotonicity relations hold: 
5 D P => A{N,S) C A(7V,P), and v{N) > v{N) A(7V,P) C A{N,R). For P C AT we have 
N C A{N, R) and hence n < on^r. 
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Definition 5 {E{D, R) and i)) A call MSW(D, R) will generate a (finite or infinite) sequence 
of pivot operations, where a certain sequence is chosen according to the probability distribution 
implicitly defined by the algorithm. In the same way the length of such a sequence is probabilistic. 
We denote by E(D, R) the expected number of pivot operations generated by MSW(D, R). For k >0 
and £ > 0 let P{k,£) denote the smallest upper bound for E(D^ R) for all feasible dictionaries D that 
define a nondegenerate linear program and all RC N such that on^r <n-\-k and |iV \ J?| = i. 

Lemma 6 The function p holds: 

(i) For any k > 0.* /3{k, 0) = 0. 

(ii) For any i > 0; l3{0j£) = 0. 

min(ib,^) 

{u)Foranyk>lJ>l:fi{kJ)</3{kJ-l)-i-\ E {I + I3{k - j,£)). 

j=i 

Proof (Outline): 

(i) ^ = 0 implies R = N, so there will be no pivot: E{D, R) = 0 and hence /3{k, 0) = 0. 

(ii) D = D{B) is terminal for P/Ry since otherwise n < a^^R in contradiction to A: = 0. 

(iii) We order the elements in N \ R = {ei, . . . , e/} such that v{R U {ei}) < ... < v{R U {e/}). 
We choose with probability j an element e = ej e N H R. Then we compute recursively 
D = MSW(D, U {e}) with an expected number of E{DyRU {e}) pivot operations which 
is at most /3{k,£ — 1). If D is terminal there are no more pivot operations, otherwise we 
make one simplex pivot to D and recursively compute MSW(£), i?) which causes together 
the expected number of 1 + E{DyR) pivot operations. We show that E{DyR) < /3{k — j,£): 
Monotonicity of the simplex pivots implies v{N) < v(N) and hence A{NyR) C A{NyR)\ for 
z < j is e» € A(N, R) \ A{N, R); i.e. ^ < on^r — j < n-h k — j. n < implies j < A;, so 
we have to extend the sum of all possible cases not further than min(A;, £). 

□ 

Theorem 7 For any k>Q,e> 0; P(k,£) < '“(<+»). 

Proof: The inequality holds for any function j3 with the properties as in Lemma 6. For a proof see 
[10], Proposition 5 (the function Pmsw there is /? + 1). □ 

Proof of Theorem 3: Remark that aAr ,0 < m + n, hence the bound ^(m, n) is valid for any linear 
programs with m = |J5|, n = \N\ for a call of MSW(D,0). We prove the finiteness and correctness 
according to the recursion of the algorithm and use induction in A;+^. Let B{k,£) denote the maximal 
number of pivot operations for any linear program with k and £ as defined for j3{ky£). As in the 
proof of Lemma 6 we can show: For any A: > 0: B(k, 0) = 0; for any ^ > 0: B(0, £) = 0; for any 
A: > 1,^ > L B{ky£) < B{ky£ - 1) + B{k — 1,£) A 1. We easily prove B{k,£) < - 1. The 

correctness of the algorithm is now trivial. □ 
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4.2 Analysis of Kalai 

In 1992 Kalai [6] presented three algorithms 5o,5i,52; Goldwasser [5] showed in 1995 that a certain 
variant of Sq is dual to MSW. In an article from 1997 Kalai [7] describes two new variants Algorithm 
I and Algorithm II and claims (without proof) that Algorithm I “is equivalent (in a dual-setting) 
to the Sharir-Welzl algorithm” of [14] which is exactly the same as the MSW algorithm of [10]. 
Kalai’s Algorithm I is the origin of the algorithm Kalai which is investigated in this subsection. 
We will compare Kalai with MSW in Subsection 4.3 with the result that the two algorithms are 
slightly different. 

Kalai’s Algorithm I is not a complete algorithm. One difficulty is that Kalai describes some 
algorithmic steps without giving the order of their execution in the algorithm. Another weakness of 
the description is the missing definition of the trivial case of the recursion. Above all, there is no 
pivot operation explicitly declared. We tried to solve these problems in a most natural way. This 
leads to the form of Kalai. We remark that also for some other forms of Kalai’s algorithm the results 
of Subsection 4.3 hold. 

We write u(i?) instead of v{P/R). Given a linear program P, a nonbasis N of P, and RC E, then 
e £ E is called strongly active with respect to (TV, R) if v{R U {e}) > v{N) and e ^ R. The set of all 
strongly active constraints is denoted by A{N^ R) and its cardinality by := |A(TV, P)|. We remark 
that the above definition is the same as the definition of an active constraint in Kalai’s paper [7]. 
The monotonicity relations 5 D P => A(iV, S) C A(AT, R) and v(N) > v(N) A{N, R) C A(iV, R) 
hold. 

Definition 8 {E{D, R) and P{k, £)) We denote by E{D, R) the expected number of pivot operations 
generated by Kalai(D, P) (analogous to E(D,R) of Definition 5). For A: > 0 and ^ > 1 let P{k,i) 
denote the smallest upper bound for E(D, R) for all feasible dictionaries D that define a nondegenerate 
linear program and all P C A such that d^^R < k and |A^ \ P[ = £. 

Lemma 9 The function $ holds: 

(i) For any k >1: P{k, 1) < 1. 

(ii) For any i > 1.* = 0. 

min(A:,/— 1) ^ 

{\\\) For any k>\,£>2: p{k,£) <p{k-l,£-\)-\- X) /3{k—j,£). 

j=i 

Proof (Outline): 

(i) Either D is terminal and then there is no pivot operation, or we have a problem of dimension 
1 which will be solved with one simplex pivot: E{D,R) < 1 and hence P{k, 1) < 1. 

(ii) A; = 0 implies on^r = 0, and then D = D(P) is terminal for P (since otherwise 0 < apf^R). 

(hi) We order the elements in TV \ P = {ei, . . . , e/} such that v{R U {ci}) < . . . < v{R U {ci]). 
If D is terminal for P then there is no pivot operation. Since £ > 2 the case |TV \ P| = 1 
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is not possible. If D is not terminal for R then we have to distinguish two cases: When the 
dictionary D is nearly optimal with redundant constraint e € iV, then e = ei and we choose 
with probability an element e = ej for some j € {2,...,^}; recursion leads (similarly to 

the analysis of MSW) to the inequality P{k^ i) < p{k — 1,^-1) + ^ E — j, ^)- In 

i=i 

the case that D is not nearly optimal we choose with probability j an element e = ej for some 

min(fc,^) _ 

j 6 {1, . . . , i}, and we will obtain the inequality 0{k, i) < p{k - 1, £ - 1) + | £ /3{k - j, £). 

j=i 

Since according to the definition of P foi i < j holds P{k — j^i) < P{k — i,£), the right hand 
side of the first recursion inequality dominates the right hand side of the second. 

□ 

Theorem 10 For any k > 0,i > 1: p{k,i) < -h 1. 

Proof; Kalai [7] does not give a proof but the reference to MSW [10] . In fact we can use Theorem 7 as 
follows: Define P(k, i) := P{k^ f + 1) — 1, and let p be the function which holds Lemma 6 everywhere 
with equality. Then Lemma 9 implies p{k,i) < P{k,i), so by Theorem 7 for any A: > 0,^ > 0: 
P{k^ £) < This is equivalent to the claim. □ 

Proof of Theorem 4; Remark that < m + n, hence the bound /?(m + n, n) is valid for any 
linear programs with m = |B|, n = \N\ for a call of Kalai(D, 0). We prove the finiteness and 
correctness according to the recursion of the algorithm and use induction in + Let B{k,i) denote 
the maximal number of pivot operations for any linear program with k and i as defined for P{k,i). 
As in the proof of Lemma 9 we can show: For any A; > 1: B{k, 1) < 1; for any £ > 1: B{0,£) = 0; 
for any A: > 1,^ > 2: B{k,£) < B{k — 1,£ — 1) B{k — 1,£). We easily prove B{k,£) < (JlJ) for 
A:>1,^>1. The correctness of the algorithm is now trivial. □ 

4.3 Comparison of Algorithms 

We give a definition for the equivalence of algorithms and two examples for the comparison of the 
two algorithms MSW and Kalai that have been presented in the previous two subsections. We 
show that these algorithms are not equivalent in our sense. 

Definition 11 (Probabilistic Pivot Sequence Distribution V{D,R)) Consider a randomized 
pivoting algorithm A(D, R) accepting as input a dictionary D and a set of indices R (e.g. as in 
MSW). After the call of A(D, R) the algorithm will generate a (probabilistic) pivot sequence, i.e. sub- 
ject to the randomization of the algorithm a sequence of pivot operations on pivots (ii, ji), { 12 ^) 2 ) ^ • • •• 
If the algorithm A with input D and R always terminates, all possible pivot sequences are finite and 
will occur with a certain probability: the corresponding probability distribution defined on the set 
of all finite pivot sequences for a call of A(D, R) is denoted by V{D, R). 

Definition 12 (Equivalence of Algorithms) Two randomized pivoting algorithms A 1(D,R) and 
A2(D, R), both accepting the same input (D, R) where D is a dictionary and R some set of indices, 
are called equivalent, if they have for every accepted input (D, R) the same probabilistic pivot se- 
quence distribution V{D,R). 
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Example 1. The first example considered is given in dictionary form (1) by the dictionary 



0 


-1 


3 


-1 


1 


2 


-1 


-1 


to 


1 


-1 


0 




-1 


0 


0 



and N = {1,2,3} and B = {4,5,6}. The probabilistic pivot sequence distributions of MSW and 
Kalai are as follows: 



Pivot Sequence 


MSW 


Kalai 


(4, 2), (5,1), (6, 4) 


Prob. =1 


Prob. = I 


(4, 2), (5,1), (6, 3), (3, 4) 


Prob. =i 


Prob. = i 



Hence the two algorithms are not equivalent. The expected number of pivot operations is 3| for 
MSW and 3| for Kalai: For Example 1 Kalai is (slightly) “faster” than MSW. This is not true 
in general, as the following example shows: 

Example 2. The second linear program is very similar to the first, we have to modify not more 
than one entry in the initial dictionary which is for Example 2 



0 


-1 


3 


-1 


1 


2 


-1 


-1 


to 


1 


-1 


1 

2 




-1 


0 


0 



The probabilistic pivot sequence distributions of MSW and Kalai for this second example are 



Pivot Sequence 


MSW 


Kalai 


(4, 2), (5,1), (6, 3) 


Prob. =1 


Prob. = i 


(4, 2), (5,1), (6, 4), (4, 3) 


Prob. =1 


Prob. = 1 



The expected number of pivot operations is 3| for MSW and 3| for Kalai. 

5. Final Remarks 

We have presented in one framework two randomized pivoting algorithms and compared them in 
order to show that they are not the same. The original discussions used very different settings 
which made comparison difficult. Our precise analyses may help us investigate where we possibly 
overestimate the real complexity. One goal is an extension of the framework to analyse a more general 
class of algorithms including primal-dual algorithms such as criss-cross methods [4]. 

Our framework is combinatorial in the sense that it uses only the signed incidence relations of the 
input data and not quantitative information: We can interprete the problems and algorithms as if 
they would be in a setting of oriented matroids instead of linear programming (see e.g. [1]). We 
even can deal with assumptions as (initial) feasibility and nondegeneracy of LP problems in the 
same combinatorial way. The techniques we may use here are e.g. symbolic bounding techniques 
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and combinatorial perturbation. We can translate the complexity results of Subsection 3.2 to related 
results for the overall complexity when this techniques are used; for a detailed discussion see e.g. [3]. 
All this makes us hope that we can develop a unifying framework for a rich class of (randomized) 
pivoting algorithms which then will be an excellent basis for further investigation of most challenging 
questions on the complexity of linear programming. 
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The Constrained Crossing Minimization Problem 

A First Approach 

Petra Mutzel and Thomas Ziegler 
Max-Planck-Institut fiir Informatik, Saarbriicken 

Summary: We investigate the constrained crossing minimization problem for graphs defined as 

follows. Given a connected, planar graph G = {V,E), 3 . combinatorial embedding n(G) of G, and a 
set of pairwise distinct edges F C V x V, find a drawing of G' = {V,EUF) in the plane such that 
the combinatorial embedding 11(G) of G is preserved and the number of edge crossings is minimum. 
This problem arises in the context of automatic graph drawing. 

We show that the constrained crossing minimization problem can be formulated as a shortest crossing 
walks problem which is basically an |F|-pairs shortest walks problem in which the number of crossings 
between the walks is added to the cost function. This allows a combinatorial approach to the 
problem. Furthermore, we suggest a new algorithm for getting an approximate solution for the 
constrained crossing minimization problem. The branch-and-cut algorithm is based on an integer 
linear programming formulation for the shortest crossing paths problem. Our computational results 
on a benchmark set of graphs indicate that our approach may be promising for the future. 

1. Introduction 

Automatic graph drawing aims on generating pleasant layouts for given graphs. Objective aesthetic 
criteria are, e.g., a small number of edge crossings, evenly distributed vertices, relatively short edges, 
few edge bends, and a small layout area. The crossing minimization criterion is among the most 
important ones. Unfortunately, the problem of minimizing the number of crossings in a drawing 
is NP-hard (see [4]) and so far no practically efficient algorithm exists even for small nontrivial 
instances. 

The graph drawing method using planarization tries to satisfy all of the above aesthetic criteria. In 
particular, it is the best method for drawing entity-relationship diagrams (ER-diagrams) or PERT 
diagrams, that appear quite frequently in graph drawing applications (see, e.g., [3, 1]). The method 
using planarization proceeds as follows: 

(1) Compute a maximum planar subgraph P of the given graph G. 

(2) Determine a combinatorial embedding II (P) of the planar subgraph P. 

(3) Reinsert the removed edge set F into the fixed combinatorial embedding I1(P) of P such that 
the number of crossings is minimized. 

(4) Replace the crossings by new artificial vertices. 

(5) Draw the resulting planar graph using any planar graph drawing algorithm. 

(6) Finally, replace the artificial vertices by crossings again. 

This method does not lead to a drawing with the minimum number of crossings. However, it is a 
relatively good approach for getting drawings with a small number of crossings. The critical steps 
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are steps (1) and (3). Here, NP-hard combinatorial optimization problems arise. The other steps 
can be solved in polynomial time. 

Deleting the minimum number of edges that leave a planar graph (the unweighted maximum planar 
subgraph problem) is NP-hard ([9]). However, in practice this problem can be solved efficiently to 
optimality (or at least, close to optimality) using polyhedral combinatorics [6]. A commonly used 
heuristic method for step (1) is to compute a maximal planar subgraph of G instead. This can be 
done in polynomial time (see, e.g., [2, 5]). Examples show that solving the maximum planar subgraph 
problem exactly yields to much better drawings than using heuristics. 

In this paper we investigate the problem arising in step (3), the reinsertion of the edges into the 
planar subgraph P preserving the combinatorial embedding I1(P). Formally, this problem which we 
call the constrained crossing number problem, is defined as follows: We are given a connected, planar 
graph G = (V, P), a combinatorial planar embedding II(G) of G, and a set of pairwise distinct edges 
F C V X V and search for a drawing of G' = (V,E\J F) in the plane such that the combinatorial 
embedding 11 (G) of G is preserved and the number of edge crossings is minimum. The constrained 
crossing minimization problem is NP-hard [10]. 

In practice this problem has been attacked so far via iterative heuristics using the following observa- 
tion. If the cardinality of the given edge set F is one, the problem can be solved in polynomial time 
via computing a shortest path in the dual graph extended by some vertices and edges. The heuristics 
iteratively reinsert one edge after the other using the dual graph approach. However, the result is 
not always acceptable. We think that the exact solution of the constrained crossing minimization 
problem will lead to much nicer drawings. 

This is the first paper which studies the problem more carefully. However, so far we cannot give a 
practically efficient algorithm which is able to solve any (even a small) instance of the constrained 
crossing minimization to provable optimality. But our studies have given new insights into the 
problem and we are confident that further studies may come up with such an algorithm. 

The paper is organized as follows. In section 2 we state the graph theoretical definitions used in 
this paper. In section 3 we define the shortest crossing walks problem and show equivalence to the 
constrained crossing minimization problem. The shortest crossing walks problem is essentially a |F|- 
pairs shortest walks problem, in which the objective function is extended by the number of crossings 
between the walks. The formulation of the constrained crossing minimization problem as a shortest 
crossing walks problem is of a rather combinatorial than a geometrical nature. However, so far we 
did not find a “nice” integer linear programming formulation for the problem. If we restrict the 
|F| pairs of walks to simple paths instead, we can formulate the new problem as an integer linear 
program (see section 4) which can be attacked via a branch-and-cut algorithm. Unfortunately, in 
general this approach will not give the optimal solution to the constrained crossing minimization 
problem anymore. However, our computational results on a benchmark set of graphs have shown 
that the solutions are better than those obtained by previous heuristics (see section 5). In section 6 
we summarize our results and state an open problem. 
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2. Mathematical Preliminaries 

We consider undirected graphs G = (V^E). A graph not containing loops and multiple edges is called 
a simple graph. We call a graph weighted^ if a nonnegative value is associated with each edge of the 
graph. 

A vq — vi walk in G is an alternating sequence of vertices and edges of G, (uq, Ci, Ui, . . . , e/, vi), 

beginning and ending with a vertex, in which each edge is incident with the two vertices immediately 
preceding and following it. If the vertices vq^vi, . . . ,vi are distinct, then the walk is called a path. 
We define the length of a walk in a weighted graph as the sum of the weights on its edges. 

A drawing of a graph G is a mapping of G into the plane. The vertices go into distinct points, called 
nodes. An edge and its incident vertices map into a Jordan arc connecting the nodes corresponding 
to its end vertices and not containing any other node. A common point of two non-adjacent arcs 
is a crossing. An optimal drawing of G is one with the least possible number of crossings. This 
least number of crossings is called the crossing number cr{G) of G. A planar graph is a graph with 
cr(G) = 0. 

A drawing of a planar graph without crossings is called a planar embedding of G. The regions of 
a planar embedding are called faces. Note that one face is unbounded; it is called the outer face. 
A planar embedding of a planar graph induces a cyclic ordering on the edges incident to any fixed 
vertex, namely the counterclockwise ordering of the edges around their common endpoint. Such 
a cyclic ordering on the edges incident to each vertex is called a combinatorial embedding 11(G). 
Different planar embeddings of a planar graph corresponding to the same combinatorial embedding 
are equivalent up to the selection of the outer face. Given a planar graph G = (V, £■), a combinatorial 
embedding 11(G), and a combinatorial embedding n(G'), where G' is the graph G extended by some 
edge set F. We say that the combinatorial embedding n(G') is embedding preserving with respect 
to n(G) iff the cyclic ordering given in Il(G') on the edges restricted to E (incident to each vertex) 
is the same as in 11 (G). 

The geometric dual graph G* = (V*, E*) of a planar graph G = (V, E) with embedding 11(G) is 
defined by the following construction. A vertex v* is placed in each face fi of 11(G); these are the 
vertices of G*. For each edge e of G we introduce an edge e* which crosses e (but no other edge of 
G) and connects the two vertices v* which correspond to the faces incident to e. G* has a loop if and 
only if G has a bridge and G* has multiple edges if and only if two faces of n(G) have at least two 
edges in common. Thus G* is not necessarily a simple graph even if G is simple. Note that II(G*) 
of G* is uniquely determined by 11(G). 



3. Formulation as a Shortest Crossing Walks Problem 



In this section we will define the shortest crossing walks problem and show the connections to the 
constrained crossing minimization problem. The idea is to generalize the approach used for \F\ = 1. 
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Figure 1: Simple and distributed crossings between walks. 



3.1 The Shortest Crossing Walks Problem 

The shortest crossing walks problem is essentially a |F|-pairs shortest walks problem, in which the 
objective function counts the lengths of the walks and the number of crossings between them. We 
first like to have a combinatorial notion of crossings between walks. 

Let G = (V, E) be a weighted planar graph with embedding 11(G) and let pi = {vq, ei, Ui, . . . , vi-i,ei, vi) 
and P 2 = (uo, /i, ui, . . . , Um-i, fm, Um) be two walks in G. Let p '2 be the reversal walk of p 2 , that 
is, p '2 = fm^ Um-i, • • • , /i, uq). We have a crossing between p\ and p 2 in the following cases. 

Either there is a proper subsequence (e, u, e') in pi and a proper subsequence (/, v, /') in p 2 with 
all edges e, e',/, /' distinct and the order of the edges around v in 11(G) is e, /, e', /' or e, 

We call this a simple crossing. Or there is a common subsequence of p\ and p 2 or p\ and p' 2 , 
{vs,es+\, . . . ,euVt) = (u 5 /,/s/+i, . . . ,/t/,Ut/), such that the order of the edges around Vs in n(G) 
is es+\,es,fs> and around Vt is Cf, et+i, /t'+i or the order around Vs is es+i,fs',es and around vt is 
et, ft>+i,et+i, where Cs+i,es,fs> and ee,et-\-i, ft'+i are pairwise distinct. This is called a distributed 
crossing. A crossing of a walk with itself is defined similarly. Figure 1 illustrates simple and dis- 
tributed crossings between two walks. 

The number of crossings of a set of walks pi, . . . ,pib is the sum of all crossings between pi and pj for 
1 £ L j < ^ (* 3 3-re not necessarily distinct) and the value of a set of walks is the sum of the 

lengths of the walks plus the number of crossings. 

We define the shortest crossing walks problem as follows. Given a weighted planar graph G = {V,E,c) 
with weights Ce > 0 on the edges, a combinatorial embedding 11(G), and a set F of distinct pairs of 
vertices of G, find a set of walks with the following properties. There is one Si-U walk for every pair 
of vertices (si,ti) e F, z = 1, . . . , |F|, and no walk uses one of the vertices contained in a pair of F 
as an inner vertex. We call a set of walks with this property feasible for the shortest crossing walks 
problem (G, II (G), F). Among all feasible sets we want to find one with minimum value. Note that 
in an optimal solution of the shortest crossing walks problem no walk crosses itself. 

3.2 Equivalent Instances of the two Problems 

Given an instance (G, 11(G), F) of the constrained crossing minimization problem we compute an 
instance (G*, I1(G*), F*) of the shortest crossing walks problem in the so-called extended dual graph 
as follows. 

We compute the geometric dual graph G* of G with respect to II(G). Then for every vertex of G 
that is an end vertex of an edge in F we place an extra vertex, called supervertex, in the appropriate 
face /* of G* . We connect every supervertex by edges, called superedges, to all the vertices on the 
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Figure 2: A graph and its extended dual graph. 

boundary of /* so that there are superedges between every successive pair of edges in the list 11(G) 
of V. Note that we have parallel superedges in G* if t; is a cutvertex of G. The resulting graph G* is 
still planar and II (G*) is uniquely determined by II (G). The set F* is the set of pairs of supervertices 
corresponding to the end vertices of the edges in F. 

See Figure 2 for an example of a graph G and its extended dual graph G*. The graph G is the one 
induced by the solid edges in Figure 2(a). Figure 2(b) shows G with its extended dual graph. The 
vertices of the dual graph of G are drawn as rectangles. The set F is shown with dotted lines in 
Figure 2(a). Hence the supervertices are 1,3,4, 5,6 and 7. The superedges are drawn with dotted 
lines. In order to insert the edge (3, 7) in G, we can use, e.g., the direct path from 3 to 7 via face a. 
In G* the edge insertion corresponds to a path using only superedges. Another possibility would be 
inserting (3, 7) via face g, crossing two borderlines of face g before joining vertex 7. This time, we 
have used two regular edges in G* which are responsible for two newly introduced crossings. 

We associate the following weights with the edges of G*. Superedges get weight 0 and all the other 
(regular) edges get weight 1. Note that we allow superedges only at the beginning or end of walks. 
We show that these two instances are equivalent, that is, given an optimal solution for the shortest 
crossing walks problem in the extended dual graph, we can compute an optimal solution for the 
corresponding constrained crossing minimization problem in polynomial time and vice versa. 

Theorem 1 An optimal solution for the constrained crossing minimization problem on (G, H(G), F) 
is equivalent to an optimal solution for the shortest crossing walks problem on (G^, II(G*), F*). 

Proof: We first show that for any optimal solution of the constrained crossing minimization problem 
with z crossings in G we can compute a feasible set of walks for the corresponding shortest crossing 
walks problem with value z in G* in polynomial time. 

We construct a feasible set of walks for the shortest crossing walks problem by constructing a corre- 
sponding walk in G* for every edge in F. Let e € F be an edge of F between the vertices s and t. 
Given an optimal solution for the constrained crossing minimization problem e defines, in a natural 
way, a sequence of faces and edges (/i, ei, /2, C2, . . . , /m-i, em-i, fm) starting with a face /i incident 
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to s and ending with a face /m incident with t. The e* are the edges crossed by e and fi, fi+i are 
the faces incident to e* for 2 = 1, ... ,m — 1. This sequence can be transformed directly to a walk 
between the supervertices corresponding to s and t in G*. 

Every crossing of an edge in F with an edge of G corresponds to using an edge with weight 1 in 
the corresponding walk in G*, so the length of every walk is equal to the number of crossings of the 
corresponding edge with edges of G. 

Crossings between edges in F correspond directly to crossings between walks. Let ei and 62 be 
two edges of F that cross. Note that edges of F cross at most once in an optimal solution for 
the constrained crossing minimization problem. The crossing between ei and 62 happens in a face 
/. When moving along the border of /, we will meet the corresponding parts of the walks al- 
ternatingly. If the corresponding parts of the walks use distinct edges to traverse F, we have a 
simple crossing in the shortest crossing walks solution. Otherwise consider the common subsequence 
(fk.Ck, fk+ii^k+iy • ‘ fi) of fhe sequences corresponding to ei and 62 which contains the 

face with the crossing. The order of the corresponding common points of ci and 62 with the boundary 
of fk respectively fi must be the same and we have a distributed crossing in the shortest crossing 
walks solution. Besides these crossings there are no other crossings between walks in the shortest 
crossing walks solution. So the value of the set of walks is exactly >2;. 

Next, we prove that for any solution of the shortest crossing walks problem with value 2: we can 
compute a solution for constrained crossing minimization with z crossings in polynomial time. 

This can be proven similarly. Every walk defines a sequence of faces and crossed edges in the primal 
graph and crossings between walks can be transformed to crossings of the corresponding edges of F. 
We only have to be careful when choosing the faces in which the crossings take place. □ 

3.3 Surprising Observations 

So far, we have translated the geometric constrained crossing minimization problem into a new 
problem of rather combinatorial nature. We think that this may be helpful for attacking the problem 
in practice. One approach could be to formulate the shortest crossing walks problem as an integer 
linear program. So far we have not succeeded in that. In this section we will discuss some observations 
we have made using the shortest crossing walks formulation for the constrained crossing minimization 
problem. The observations lead to problems when searching for an ILP-formulation for the shortest 
crossing walks problem. 

Observation 1: Pairs of edges may cross more than once 

In (general) crossing minimum drawings, every pair of edges crosses at most once. Here, this is no 
more true. In an optimal solution for the constrained crossing minimization problem it can happen 
that an edge e £ F crosses an edge f £ E several times. In the extended dual graph this corresponds 
to using the corresponding edge f* several times in a shortest crossing walks solution. 

This leads to the problem that ( 0 , 1 )- variables are no more sufficient in an ILP-formulation for the 
shortest crossing walks problem. 

Observation 2: Shortest Crossing Paths are not sufficient 

Our first conjecture, that it may be sufficient to restrict the shortest crossing walks problem to simple 
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paths is, unfortunately, not true. Figure 2 shows an example in which a solution when restricted 
to simple paths gives a value of three, whereas the optimal solution of the shortest crossing walks 
problem (and hence also of the constrained crossing minimization problem) is two. Let G and U{G) 
be as shown in Figure 2(a) (solid lines) and F = {(3, 7), (1, 4), (1, 5), (1, 6)}. The extended dual graph 
G* is shown in (b). The optimal solution to the constrained crossing minimization problem is shown 
in (a) with dotted lines. However, in G*, the walk w corresponding to the edge (3, 7) is not a simple 
path, since it traverses the face corresponding to the vertex a in G* two times (see Figure 2(b)). 
More formally, w traverses the vertices (3, a, p,a, 7), where the path from a to g crosses the edge 
(0, 1) in G and the path from p to a crosses (1,2). An optimal solution of the shortest crossing walks 
problem in which we restrict the solution to a set of simple paths, is the one shown in Figure 2(a) 
where the path for edge (3,7) is substituted by a path from 3 to 7 crossing the edges (1,4), (1,5), 
and (1,6). 

Observation 2 leads to a serious problem, namely that crossings between a pair of walks can no more 
be detected locally if we do not know the exact order of the edges on the walk. Hence it is hard to 
come up with inequalities modelling the number of crossings in a set of walks if every walk is only 
given by a set of edges. 

4. An ILP-Formulation for the Shortest Crossing Paths Problem 

In this section we present an ILP-formulation for the shortest crossing paths problem on the extended 
dual graph G* = (V*,E*) with embedding H(G*) and the set F* of pairs of supervertices. The 
difference to the shortest crossing walks problem defined above is that we only allow (simple) paths 
between the pairs in F instead of walks. In our computational experiments we use a branch-and-cut 
approach based on the ILP formulation described here. 

We call the elements of F* commodities. For commodity k = {sk,tk) 6 F* we define Fjf as the set 
of regular edges in G* plus the superedges incident to Sk or tk and introduce a variable for each 
edge e G Fjf . For every pair of commodities A;, I € F*, A; ^ and every regular vertex u we introduce 
a variable called the crossing variable at u for commodities k and /. 

In the description of the integer linear program we use the following notation: The weight of an edge 
e is denoted by Cg (recall, that Cg = 0, if e is a superedge, and Cg = 1, otherwise). The set of regular 
vertices in V* is denoted by V^. 6{v) denotes the set of edges incident to u 6 V^*. Let E' be a subset 
of E* and k G F*. By x^{E') we denote the sum of all Xg with e G (F' fl Fjf). Let s,t eV* be two 
distinct vertices. An (s,t)-cut in G* is a set of edges C C E* such that there is no path between s 
and t in G \ G. 

With this notations the integer linear program looks as follows. 

min X) XI + E E 

keF* e£El uev* k^leF* 

subject to 



^kJeF*,k^l,eeEl,ueV^ 

VA; G F* and u end vertex of commodity k 



Xe,zt‘ e {0.1} 

x*(J(w)) = 1 



( 1 ) 

( 2 ) 
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\ e) - X* > 0 Vw e V2, Jk e F*, e 6 n 5(u) (3) 

x*(C) > 1 VC (s, <)-cut of Gk = (V*, El),k={s,t)^F* (4) 

2x*' > 2(x* + + X* + x^) - 2 - x*(5(«)) - x‘((5(u)) (5) 

Vu e V^,k,le F*,k:/:l,a,ce E*knS(u),b,de EtnS{u),a <b<c<din n(C*) 

2 E > 2(x* + x^ + X* + x^) - 2 - X) (^*(<5(w)) + a:'(5(w))) + ^^(x* + x') (6) 

weP weP eeP 

V P path between u^v e V^, u ^ v ,kj € F*, k I 

a,b,ce (J(u) and a € Fjf, b e Ef, c € FJ fl F* fl P, a < 6 < c in I1(G*), 

d,e,fe 6{v) and e e El, f G Ef, dGFjDFJ^nP, d<e< / in II(G*) 

Every feasible set of paths for the shortest crossing paths problem defines a feasible solution for this 
integer linear program as follows. For every A; G F"^ let = 1, if e is used by this path, and = 0, 
otherwise. For every pair k, I G F*, k ^ I, let = 1, if the paths for k and I have a simple crossing 
at u G V^. For every distributed crossing between the paths for k and I we choose one vertex u on 
the common subpath and set = 1. All not set to one above are set to zero. This solution 
satisfies all the constraints of the ILP and the value of the objective function is equal to the sum of 
the path lengths plus the number of crossings between paths. 

On the other hand every feasible solution of the ILP defines a feasible set of paths for the shortest 
crossing paths problem. The first four constraints (almost) restrict the solutions for every k e F* to 
paths. It is possible that the set of edges for a A; G F* contains cycles, but we can easily eliminate 
such cycles. 

Constraints (5) and (6) directly model simple and distributed crossings, respectively, between the 
paths for k, I G F*, k ^ I, that is, 2 : has to be one to satisfy the constraint, if there is a crossing. The 
value of the objective function is exactly the sum of the path lengths and the crossings between the 
paths. 

5. Computational Experiments 

We implemented a branch-and-cut algorithm based on the integer linear programming formulation 
presented in Section 4. The algorithm is not able to solve the constrained crossing minimization 
problem to provable optimality; however, our hope was to improve the solutions obtained by the 
currently used heuristics. The implementation uses ABACUS [8], a framework for implementing 
branch-and-cut algorithms. For an introduction to branch-and-cut algorithms, see [7]. The initial 
linear program contains the inequalities (1), (2) and (3). During the run of the algorithm only those 
inequalities of classes (4)- (6) are added that are violated at some solution x. The separation problems 
have been solved using minimum-cut algorithms and shortest path algorithms. For a more detailed 
description of the algorithm, see [10]. 

We compare our algorithm to the naive approach (just computing independent shortest paths and 
then counting the number of crossings) and the commonly used iterative heuristic for the constrained 
crossing minimization problem (see Section 1). 
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Figure 3: Computational Results 

We tested our programs on a set of 3715 graphs taken from a graph library of the University of Rome 
III used in [3]. The library contains 11582 graphs with 10 to 100 vertices. We only considered the 
nonplanar graphs with less than 60 vertices. We could compute maximum planar subgraphs for 3715 
out of the 3882 nonplanar graphs. For the maximum planar subgraphs we computed combinatorial 
embeddings. All experiments were run on a SPARC Ultra 2/2x200 with 1024 MB main memory 
under Solaris 2.5.1. 

Figure 3(a) shows the distribution of the graph sizes in our test set. A first surprising result is that 
for many graphs we only needed to delete one edge in order to get a planar graph. The number of 
deleted edges does never exceed nine. Figure 3(b) shows the distribution of the sizes of the sets F. 
Figure 3(c) shows the average number of crossings obtained when using the naive approach, the 
iterative heuristic and our new heuristic based on the shortest crossing paths problem. Since we have 
not been able to compute the exact optimal solution for all the graphs, we restricted the computation 
time to 10 minutes for |F| <= 7 and to 2 hours for |F| > 7 and took the best solution obtained within 
the time limit. (Unfortunately, we did not have the computational resources needed for increasing 
the time limit. This will change in the near future). Therefore, the figure also shows the lower 
bound obtained within the branch-and-cut algorithm. A lower bound to the constrained crossing 
minimization problem can be obtained via computing all the shortest paths and only counting the 
lengths of the paths. 

So far our obtained solutions did not improve much. However, we hope that they will improve further, 
once we are able to solve the shortest crossing paths problems to optimality. The lower bounds for 
the shortest crossing paths problem show that there is still room for improvement. Our guess is that 
we may improve further for the benchmark graphs with more than 60 vertices. 

6. Conclusions and Further Directions 

This is the first time that the edge re-insertion step for the planarization method is investigated. We 
have formulated the constrained crossing minimization problem as a shortest crossing walks problem 
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allowing a purely combinatorial approach for the originally geometrically formulated problem. Nat- 
urally, this leads to the question if there is a combinatorial algorithm (not necessarily polynomial) 
for solving or approximating the constrained crossing minimization problem. 

Furthermore, it is the first time that computational results provide lower and upper bounds for the 
constrained crossing minimization problem for instances from a benchmark set of graphs. 

Corresponding Author: Thomas Ziegler (tziegler®mpi-sb . mpg . de) , 

Max-Planck-Institut fiir Informatik, 

Im Stadtwald, 

D-66123 Saarbriicken (Germany) 
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Sensitivity Analysis of the Bipartite Weighted Matching 

Problem 



H. Achatz, University of Passau 

Summary: Let G be an undirected bipartite graph where a weight is assigned to every edge. A 

maximum weighted matching can be computed on G. We define slightly changed problems of the 
original matching problem. One type of problem is to delete a vertex, a pair of vertices or an edge 
from G. A second type of problem can be constructed if a vertex or a pair of vertices is doubled by 
making an identical copy. A third type of problem is the replacement of a vertex by a copy of another 
vertex. All these problems build up some kind of sensitivity analysis of the original problem and are 
of special interest for example in man power scheduling. We can show that all these problems can be 
solved simultaneously by finding the shortest path between all pairs of vertices in a graph G' which 
can easily be constructed. 

1. Introduction 

Let G = (V, E) be an undirected bipartite graph with n vertices. The set of vertices is partitioned 
into the sets X and Y. Therefore it holds that V = X UY and E C X x Y describes the possible 
edges. A weight w{x,y) > 0 is assigned to every edge [x,y] e E. A maximum weighted matching 
can be computed on the graph G. The set of matched edges is denoted by M. The overall weight 
of the maximum weighted matching M of G is MW{G) = W{M) = w(x^y) where MW{G) 

[x,y]eM 

denotes the maximum weight of matching in G and W (M) is the weight of a matching M. 




Figure 1: maximum weighted matching 

It is known [PS82] that in the weighted matching problem, we can do away with the graph G by 
adopting the convention that the underlying graph is always complete, and letting the weights of 
those edges that were missing in G be equal to zero. Because we are dealing with a bipartite graph 
we may assume that l^l = |y| - otherwise add new vertices with edges of weight zero. 

We may observe that the optimal solutions will always have perfect matchings. The bipartite weighted 
matching problem is also known as the assignment problem. The objective value of the weighted 
matching problem in Fig. 1 is 33. Because we only want to look at perfect matchings, there is a 
matched edge (1,6) with weight zero. Throughout the remaining paper \X\ = \Y\ and G is a complete 
bipartite graph. 

In this paper, we are interested in the solution of slightly changed problems. We can show that 
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Figure 2: Directed graph G' and distance matrix 

all these solutions can be achieved by solving a shortest path problem on a digraph constructed by 

means of a known maximum weighted matching. The problems are as follows: 

Gu For a vertex u in G let G^ denote the graph formed by deleting u from G. The problem is 
defined in [KLST97] and is called the all-cavity maximum matching problem which asks for a 
maximum weighted matching in G^ for all u in G 

Gx,y For a pair of vertices x e X and y e Y let Gx,y denote the graph formed by deleting x and 
y from G. The problem is again defined in [KLST97] and is called the two-vertex all-cavity 
maximum matching problem which asks for a maximum weighted matching in Gx,y for every 
pair of vertices x and y. 

G[x,y] For an edge [x, y] e E let G[x,y] denote the graph where the weight of [x, y] is set to zero. 
The problem asks for a maximum weighted matching in G[x,y] for every edge [x, y] € E. 

G^ For a vertex u in 6^ let G^ denote the graph formed by copying u and the adjacent edges G. 
The problem asks for a maximum weighted matching in for all u in G. 

G^^y For a pair of vertices x e X and y e Y let G®’*' denote the graph formed by copying both 
vertex x and vertex y and all adjacent edges in G. The problem asks for a maximum weighted 
matching in for every pair of vertices x E X and y eY. 

Gl For a pair of vertices u,v E X (or u, u € Y) let Gl denote the graph formed by deleting vertex 

u and copying vertex v and all adjacent edges in G. The problem asks for a maximum weighted 

matching in G^ for every pair of vertices u,v E X (u,v eY, respectively). 

2. Sensitivity analysis 

In order to do the sensitivity analysis we construct a directed graph G'. The set of vertices is the 

same as before. The arc set of G' is constructed as follows: 

- arc (x, y) with weight w{x, y), if [x, y] E M, x E X and y EY 

- arc (y, x) with weight — iy(x, y), if [x, y] E E\M, x EX and y eY 
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In Fig. 2 we show the graph G' constructed from the solution of the problem in Fig. 1 . We can show 
that our constructed graph G' contains no negative weight cycle. On the righthand side in Fig. 2 
there is the computed distance matrix. 

Lemma 1 G' contains no negative weight cycle. 

Proof: Suppose G' contains a negative weight cycle C. C is an alternating cycle with respect to M 

due to our construction of G'. Hence, C can transform the matching M to another matching 
Ml. It holds: 

W(M) = W(Mi)+ E 

{ij)eCnM (iJ)eC\M 

Because C is a negative weight cycle it holds that W{M) < W{Mi) and therefore M was not 
maximal. □ 

Let Ml and M2 be two matchings of G. Take all edges [x, y] 6 (Mi U M2) \ (Mi D M2). If all edges 
[x, y] from Mi are directed from X to Y with weight w{x,y) and all edges [x, y] from M2 are directed 
from y to X with weight —w{Xj y) then these arcs form a disjoint set of directed weighted paths and 
cycles. This follows directly from the matching property. These directed weighted paths and cycles 
are called the transformation set T(Mi,M2). TW(T(Mi, M2)) denotes the sum of all weights of 
paths and cycles in T(Mi,M2). The elements of the transformation set T(Mi,M2) can transform 
Ml into M2. It is obvious that the following holds: 

W{Mi) = W{M 2 ) + TW(T(Mi, M2)) 

If both matchings are perfect then T(Mi, M2) has only cycles as elements. 

Now, we apply a shortest-path-algorithm on G' which for every pair of vertices u and v computes the 
shortest distance d{u, v) with respect to the edge weights in G'. (Take for example the Floyd- Warshall 
algorithm described in [PS 82 ] which solves the problem in 0 {n^)) 

These distances give us some information on our sensitivity analysis. From this distance matrix we 
can read several results for slightly changed problems of the original matching problem. Notice that 
all the listed problems are solved with one solution for the all-pair-shortest-path-problem- 
We start with the analysis of the problem G[x,y]. If the edge [x, y] is unmatched then M is of course 
again a maximum weighted matching for G[x,y]> Hence, we only have to consider matched edges. 

Lemma 2 MW{G) - MW{G[x,y]) = min{w(x,y),d{x,x)}, [x,y] € M. 

Proof: We have to look at two different cases: 

Case 1 : d(x,x) < w(x,y) 

Let C describe the shortest path from x to x. C is an alternating cycle w.r.t M and therefore 
C can transform M into a matching M\. The net change of the weight is equal to d(x,x). 
The edge [x,y] is not contained in Mi and therefore Mi is a matching of G[x,y]- If there is a 
matching M2 of G[x,y] with W{M\) < W(M2) we are able to construct a cycle containing x 
with weight less than d{x,x). This is a contradiction. 




138 



Case 2: d{x^x) > w{x,y) 

Since every cycle containing x has at least weight w{x^ y) every perfect matching of G[x,y] is 
not better than the matching M \ {[x, 2 /]}. Every non-perfect matching of G[x,y] is not better 
than the best perfect matching of G[x,y]- D 

In [KLST97] this problem is formulated as an open problem. In our example we can see that deleting 
the matched edge [1, 6] does not change the value of the maximum weighted matching. If edge [2, 4] 
is deleted we loose d(2, 2) = 8 units. 

Remark 1 For [x, y] E M the following holds: d(x, x) = d{y, y) 

Next we look at the problem Gxy which is called the two- vertex all-cavity maximum weighted match- 
ing problem in [KLST97]. In [KLST97] this problem was solved by a different construction of G'. 

Lemma 3 MW{G) - MW{Gxy) = d{x, y), x e X,y e Y. 

Proof: Let P be the path from x e X to y eY with length d{x, y) in G'. P is an alternating path 
with respect to M. It starts and ends with a matched edge. Therefore, P can transform M into a 
matching Mi of G^y. Let M 2 be a maximum weighted matching of Gxy with W{M 2 ) > W{Mi). 
As defined before the set T(M, M 2 ) transforms M to M 2 . 

Case 1: An element of T(M, M 2 ) contains x and y. Because x and y are not in an edge of M 2 
they have to be start and end points of a path. Due to W(M 2 ) > W’(Mi) the weight of this 
path has to be less than d(x, y), which is a contradiction. 

Case 2: There are at least two elements in T, one element P\ contains x as start point and the 
other element P 2 contains y as end point. Because M is perfect Pi ends in a vertex yi e Y. 
The same is true for P 2 which has to start in a vertex xi e X. The weight of Pi and P 2 
in T(M, M 2 ) is at least d{x,y) (concatenate Pi and P 2 by adding edge [xi,yi]). This is a 
contradiction. Hence, Mi is a maximum weighted matching of Gxy. □ 

In [AM093] the opposite problem is handled. They solve the shortest path problem by invoking 
an assignment problem. In the underlying graph of the assignment problem the original vertices 
are copied to build a bipartite graph. Then, from each side one vertex x and y is deleted and the 
resulting assignment determines the lenght of the shortest path from x to y. 

In the problem G„ only one vertex is deleted. Because M was a perfect matching at least one other 
vertex can not be matched in G^. 

Hence, by using the values of Gxy we can compute Gx and Gy. 

MW{Gx) = mm{MW{Gxy)},x € X 
MW(Gy) = rmn{MW{G^)},y e K 

XcA 

Another type of problem can be constructed if a vertex is doubled by making a copy. This means 
that a new vertex v' is added to G with the same edge weights as v. The problem G®^ asks for the 
maximum weighted matching of a graph where both x e X and y G y are copied. If [x, y] is matched 
in M then of course M U {[x', y']} is a maximum weighted matching of G®^. 
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Lemma 4 MW{G) — MW{G^^) = min{—w{x,y)^d{y,x)}, x £ X,y € Y. 

Proof: Consider two cases 

Case 1: d{y^x) > —w{x^y). This means that [x,y] is matched in M. Let Mi denote the 
maximum weighted matching of G®*'. 

Case la: [x’,y’] is matched in Mi. Therefore, Mi = MU {[x',y’]} 

This means that MW{G) — MW{G^^) = —w{x^y). 

Case lb: [x',y'] is not matched in Mi. There is a directed path P from y' to x' in T(M, Mi) 
because both matchings are perfect. W{P) > d{y^x) 

Therefore, W{M) - l^(Mi) = Tl^(T(M,Mi)) > d{y,x) > -w{x,y) 

It follows that M U {[x', y']} is a better matching. Hence, [x', y'] is matched 

Case 2: d{y,x) < —w{x,y) 

Let P be the path from y to x with length d(y^ x) in G'. P is an alternating path with respect 
to M. It starts and ends with an unmatched edge if the starting point y and the endpoint x 
are replaced by the new vertices y' and x'. Therefore, P can transform M into a matching Mi 
of G^y. Let M 2 be a maximum weighted matching of with W(M 2 ) > W(Mi). There is 
a directed path Q from y' to x' in T{M, M 2 ) Because x’ and y' are not in an edge of M they 
have to be start and end points of Q. Due to W{M 2 ) > W{Mi) the weight of this path has to 
be less than d{y,x). Hence, MW{G) — MW(G^^) = d{y,x) and Mi is a maximum weighted 
matching of G®^. □ 

If only one vertex is copied we have to distinguish between vertices in X and in Y. 

Lemma 5 

MW{G) - MW{G^) = mm J0,d(u,x)}, x£X. 

MW{G) - MW(Gy) = - ""min \o,d{y,v)}, y£Y. 

v€Y,v^y 

Proof: We present only the proof for a vertex copy in Y. 

Case 1: d{y^ v) > 0 £ Y,v ^ y 

Every attempt to match the new vertex does not improve the existing matching. Hence, 
MW{G) - MW {Gy) = 0. 

Case 2: Let v £Y be the vertex where the minimum is achieved and d(y, v) < 0. The shortest 
path from y to v can transform M into a matching of G*' if the first edge on the path from y 
is matched to the new copy of y. Vertex v is now unmatched. If there is a matching M 2 of 
Gy which is better then there also exists a path from y to a vertex in Y which is shorter than 
d{y,v). This contradiction shows that the following holds: MW{G) — MW {Gy) = d{y,v) □ 

If a vertex Xi £ X replaces a vertex X 2 ^ X (vertex X 2 is deleted and vertex xi is doubled in the 
notation above) there exists again a perfect matching. These problems are denoted by G^l (problem 
Gll if 2/1 € y and j/2 € F). 
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Lemma 6 

MW{G) - MW{G%\) = d{x2,xi), X\ ^ X2,X\ € X,X2 £ X. 

MW(G) - MW{Gll) = %i,2/2); yi # ^2,2/1 € F, 2/2 € Y. 

Proof: We again present only the proof for a vertex replacement in Y. 

Let Ml be a maximum weighted matching of Both M and Mi are perfect matchings. 
Therefore, T(M, Mi) contains a path from the copy of yi to 2/2 and 
MW{G) - MW{Gl\) > d(2/i,2/2). 

The shortest path from the copy of 2/1 to 2/2 can transform M into a matching of G ^\ . 

This means that MW(G) — MW{G^\) = d( 2 /i, 2 / 2 ) ^ 

All the proofs given above are constructive. This means that the result of the all-pair-shortest-path 
problem can be used to construct an optimal solution for every defined sensitivity problem. 

Theorem 1 The optimal objective value of the problems G[x,y], Gxy, Gx, Gy, G^^, G%\, G^, 
and G^ for all x,xi,X 2 G X, y,yuV 2 € Y can be computed in time O(n^). 

Proof: A maximum weighted matching M of G is computed in time O(n^). The construction of 
G' takes O(n^) time. The distance matrix is computed in time O(n^). All optimal objective 
values of the defined problems can be computed in 0(n) from the distance matrix, the weight 
matrix and M as shown in lemmas 2-6. □ 

3. Computational results 

We have analyzed our algorithm with different randomly generated problems. The following table 
shows run time results if we compare the new approach to solving each problem by calling an 
assignment routine. This algorithm is taken from [AKP91]. The running times are taken from a 
SUN sparcstation 20 with 128 MB RAM using the gnu-compiler gcc, version 2.7.2. 

The column headings have the following meaning: 
n number of vertices in X 

matching CPU seconds for the solution of the matching problem 

sensitivity CPU seconds for the solution of all sensitivity problems 

m-calls CPU seconds for the solution of all sensitivity problems by calling matchings 

factor mcalls / sensitivity 

To solve all defined sensitivity problems of size n = 200 means that we had to compute 121000 
matching problems. With our new approch we have been 181 times faster. 

These results can be used very efficiently within the analysis of manpower scheduling problems 
[K197]. The computed distance matrix allows various detailed analysis for situations where workers 
with specified skills should be replaced or hired. The costs of the absence of a worker or the benefits 
of training a skill are other examples of information which can be directly read off from the matrix. 
Of course, all these results can be achieved by solving O(n^) different weighted matching problems. 
In our approach we only have to compute the distance matrix once and then we have got a solution 
for all the enumerated problems which in terms of complexity is a substantial improvement. 
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n 


matching 


sensitivity 


m-calls 


factor 


10 


0.005779 


0.009944 


0.359227 


36 


20 


0.002880 


0.075002 


3.009560 


40 


50 


0.011764 


1.258078 


68.75107 


55 


75 


0.014245 


4.111623 


297.9279 


72 


100 


0.020882 


7.765758 


785.04356 


101 


150 


0.050458 


25.60069 


4524.2150 


176 


200 


0.063156 


60.90285 


11062.534 


181 



Table 1: randomly generated matching problems (weights between 0 and 31) 
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Summary: In this paper we find a particular partition of the vertex set of claw-free strongly chordal 
graphs in which each element is a clique, and we show that the adjacency graph of these cliques is 
a tree. In particular, the presented results imply the existence of an ordering of the vertices, and a 
corresponding edge orientation, such that each directed path is contained in at most two maximal 
cliques. As shown by the authors in previous works, this allows to give performance guarantee 
approximation results on a wide class of optimization problems. 

1. Introduction 

In this paper we find a particular partition of the vertex set of claw-free strongly chordal (CFSC) 
graphs in which each element is a clique, and we show that the adjacency graph of these cliques is a 
tree. This result extend significantly previous achievements given only for subclasses of CFSC graphs 
[3, 4, 6]. In particular, the presented results imply the existence of an ordering of the vertices, and 
a corresponding edge orientation, such that each directed path is contained in at most two maximal 
cliques. As shown by the authors in previous works, this allows to give performance guarantee 
approximation results on a wide class of optimization problems [3, 4, 5, 6]. 

The main contributions of our work are as follows. Firstly, we characterize a maximal clique of G 
based on the structure of its neighbourhood. Defining an adjacent path P of a maximal clique Q 
of G as a chordless path such that each vertex of P is adjacent to Q, we show that each maximal 
clique of G has no adjacent path of length greater then 2. We show how this characterization implies 
that the subgraph G{V\Q) decomposes into a set of mutually disjoint CFSC graphs, which are also 
mutually disconnected. Based on the previous property, we give a recursive procedure to partition 
the vertex set of G into a family A of cliques of G. Moreover, we show that the adjacent clique graph 
Ga of G corresponding to the partition A is a tree. Note that, this property does not hold for a 
general set of cliques covering all the vertices of G [8], nor for the clique graph Gc of G [1, 3]. 

2. Basic notations and definitions 

A graph is a pair G = (V^P), where V is a finite set of n = |y| elements called vertices, and 
EC {(x,y) £ V X V \ X ^ y} is & set of m = \E\ unordered vertex pairs called edges. For two 
distinct vertices x, y we say that x is adjacent to y (or equivalently, y is adjacent to x) if {x, y) £ E\ 
otherwise, they are said to be independent. 

Given G = (V, E) and a subset X CV, the graph G{X) = {X, E{X)) is the subgraph of G induced 
by X if E[X) = {{x,y) £ E \ x,y £ X}. Two subgraphs G{X), G(Y) of G are (mutually) disjoint if 
X n y = 0. Two disjoint subgraph G(X), G(Y) are adjacent if there exist an edge (x,y) £ E, with 
X £ X and y £Y. 

If we orient an existing edge (x,y) of G = {V,E) from x to y, we write {x ^ y). If all edges in E 
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are oriented, they are called arcs, and the resulting graph is called an oriented (directed) graph. 

A path of length k in an undirected graph G — (V, E) (a directed graph H = (V, A)) is a sequence 
of distinct k vertices {xi,X2, . . . ,Xk) of V, such that (xi,a;2),. . {xk-i,Xk) G E ((xi -> X2),..., 
{xk-i — > Xk) G A). 

An orientation of a graph G = (V, E) is the set of arcs A (or, generally speaking, the oriented graph 
H = (V^, ^)) such that \A\ = \E\, and for each (x,y) G E either (x y) e A, or {y x) e A. 
An orientation H = (V, A) of G = (V, E) is acyclic if there is no path {xi,X2,- • in H such that 

X\ =■ Xk> 

An orientation H = (V,A) of G = (V^E) is transitive if for each {x y),{y z) e A then 
(x z) G A; hence, a transitive orientation is also acyclic. A graph G = (V, E) which is transitive 
orientable is called a comparability graph. Comparability graphs can be recognized and transitively 
oriented in polynomial time [ 8 ]. 

A graph G is connected if for all two distinct vertices u^v £ V there is a path (u, . . . , in G, 
otherwise G is disconnected and formed by p > 2 disjoint maximal connected subgraphs called 
connected components. We say that two disjoint subgraphs G(X),G(F) of G are linked if there is a 
path (x, . . . , y) in G, with x £ X and y £Y, otherwise they are unlinked. 

A path (xi, X2, . . . , Xk) is a cycle of length k if also (xi^Xk) G E. 

An even (odd) cycle (path) is a cycle (path) of even (odd) length. 

A tree is a connected graph without cycles. 

A chord in a path (xi,X2, .. .,Xk) is an edge (x», Xj) £ £*, such that Xi and Xj are not adjacent in the 
path, i.e. \i - j\ ^ 1 . 

Being the distance d(u^v) between two vertices u and v the number of edges of a shortest path 
between u and u, a chord in a even cycle (xi, . . . ,X2n) is an odd chord if the distance in the cycle 
between Xi and xj is odd. 

A path (cycle) of a graph G is chordless if the subgraph induced by its vertices is a path (cycle); 
in particular, a hole is a chordless cycle. Let Pk and Ck denote the chordless path and cycle with k 
vertices. 

A graph G is a chordal graph if each cycle in G of length at least 4 has a chord; that is, G contains 
no induced hole of length greater than 3 . A graph G is a strongly chordal graph if G is chordal and 
each even cycle in G of length at least 6 has an odd chord. 

A set V' C V oi vertices is a clique of G if the subgraph G(V) of G induced by V' is a complete 
graph. A maximal clique is a clique not properly contained in any other clique. A maximum clique 
is a (maximal) clique with largest number of vertices among all cliques. 

Given a graph Go = (V^, Eq) with r vertices {ui, . . . , u^}, and n disjoint graphs Gi = (Vi, Ei ),..., Gr = 
(K, Er), the composition graph G = Go[Gi, ..., Gr] is the graph G = (V, E), such that V = Ui^=i Vi, 
and E = IJLi Ei U {(x,y) \ x £ Vi,y £ Vj and (vi^Vj) £ Eq}. Let us call Go the outer factor, and 
Gi, . . . , Gr the inner factors. Vice-versa, a graph G = (V, E) is decomposable if it can be expressed 
as a non trivial composition of some of its induced subgraphs, that is if there exists a partition of 
V into r < n nonempty pairwise disjoint subsets Vi, ... , K, such that G = G(R)[G(Vi ), . . . , G(Vr)], 
for any set of representatives R = {ui , . . . ,Ur},Uj G VJ. Such a partition is said to induce a proper 
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decomposition of G. 

The clique graph of a graph G is the intersection graph of the family of the maximal cliques of G. 
Given a graph G = (V, E), the (open) neighborhood of a set X of vertices of G is the set Ng{X) = 
{y ^ y\X \ 3x £ X : (x, y) G Ng[X] = Ng{X) U X denotes the closed neighborhood of X. 

A graph G is a claw-free chordal graph if it is chordal and there is no subgraph of G isomorphic to 
the (or claw) graph. 

An incomplete trampoline (or incomplete sun) is a chordal graph G of 2n vertices, for some n > 3, 
whose vertex set can be partitioned into two sets, W = {wi, . . . , u;„}, and U = {ui, . . . , Un}, such 
that W is an independent set and, for each i and j, Wi is adjacent to uj if and only if i = j or 
z = ( j + 1) mod n. 

A trampoline (or sun) is an incomplete trampoline G in which G(U) is a complete graph. A 3-sun 
is a trampoline with n = 3. 

3. Characterization of cliques of a CFSC graph 

In this section, we characterize the maximal cliques of CFSC graphs. This characterization is based 
on the analysis of the structure of the neighborhood of a maximal clique. Let us start giving some 
definitions. 

Definition 1 Given a maximal clique Q C V of a graph G = (V,E), a maximal clique K of the 
induced subgraph G(V\Q) is a maximal adjacent clique ofQ, if the set X — KC\Ng{Q) is non-empty, 
and there is no other maximal clique J ofG{V\Q), such that X C {J Cl Ng{Q)). 

Definition 2 Given a maximal clique Q ofG, an adjacent path Pj{Q) = (xi, . . . , Xj) ofQ in G(V\Q) 
is a chordless path, such that each x» G Pj{Q) belongs to Ng{Q). 

Lemma 1 Given a maximal clique Q of a chordal graph G = (V,E), a chordless path Pj = 
(xi, . . . ,Xj) in G{V\Q), such that X\,Xj G Ng{Q), is an adjacent path of Q. 

Proof: Since Pj = (xi, . . .,xj) is a chordless path in G{V\Q), there is no edge between (xi,Xi+rn) 
in E, for i = l,...,j - m and j > m > 2. Hence, being G chordal, for each vertex Xj G Pj there 
exists an edge (xi,y) G E with y G Q, otherwise the subgraph G{Pj U Q) contains a chordless cycle 
of length at least 4. □ 

In the following, generally speaking, we refer to Pj{Q) as an adjacent path of Q of length j. 

Lemma 2 Let Q be a maximal clique of a claw-free chordal graph G = {V,E). There exist two 
intersecting maximal adjacent cliques of Q if and only if there exists an adjacent path P^iQ) = 
(xi, X 2 , X 3 ) of length 3 in G{V\Q). 

Proof: Let Ki,Kj be two intersecting maximal adjacent cliques of Q in G{V\Q). In the subgraph 
G[KiUKj), there exist at least two vertices X\ G {Ki\Kj) and X 3 G (Kj\Ki), such that (xi,X 3 ) ^ E, 
otherwise Ki or Kj is not maximal clique in G{V\Q). 




145 



By definition of maximal adjacent clique, KinNa{Q) gL K^riNciQ) and KjC^NciQ) (jL Ki^NQ{Q)- 
Moreover, the case in which Ki fl Nq{Q) = Kj n Ng{Q) does not occur. In fact, if this is the case, 
there exist two vertices y G (i^i fl Kj) and z ^ Q such that {xi,y), (x3,y), (y,z) 6 E, and, since 
{xi,xs) ^ E, it results that {x\,z) 6 E and/or (0:3,2:) € E otherwise the subgraph G{xi^xz^y^z) is 
a claw. But this cannot occur because Ki fl Ng(Q) = Kj fl Ng(Q) and xi ^ Kj and xs ^ Ki. 
Therefore, there exist two vertices Xi € Ng{Q) H (Ki\Kj) and 0:3 G Ng{Q) H (Kj\Ki) otherwise Ki 
and Kj are not maximal adjacent cliques of Q; then, by Lemma 1, PsiQ) is an adjacent path of Q. 
Vice-versa, given Q, let us suppose that there exists an adjacent path Pz{Q) = {xi,X2,xs) of Q in 
G(V\( 5 ). Since {xi,xs) ^ E, there is no maximal adjacent clique containing PsiQ)] hence, there 
exist at least two maximal adjacent cliques Ki^Kj^ containing {a:i,o:2} and {0:2, X3}, respectively. 
Therefore, Ki fl Kj ^ 0 . □ 

Lemma 3 Given a claw-free chordal graph G = (V, E), each maximal clique Q in G, for which there 
exist two intersecting maximal adjacent cliques, contains at least 3 vertices. 

Proof: Supposing w.l.o.g G connected, the number of vertices of any maximal cliques of G is 
obviously greater than 1 . Now, by contradiction, let us suppose that there exists a maximal clique 
Q = {y^^} of type B. By Lemma 2, there exists an adjacent path PsiQ) = {xi,X2,xs) of Q in 
G{V\Q). Hence, denoted by E' C E the set of edges between PsiQ) and Q, it results \E'\ > 3 . If 
\E'\ = 3 , there are two cases: both xi,x^ are adjacent to a vertex y ^ Q] Xi,xs adjacent to different 
vertices y,z e Q; but, in the first case, the subgraph G{{y,z,Xi,Xz}) is a claw, while in the latter, 
the subgraph G{{xi,X2,xs,y, z}) contains a chordless cycle of length 4 . If \E'\ > 3 , at least one 
vertex of Ps{Q) is adjacent both to y and 2:, hence, Q is not a maximal clique of G. □ 

Proposition 1 Given a maximal clique Q of a claw-free chordal graph G = i^^E), if Ps{Q) = 
{xi,X2, X3) is an adjacent path of Q, then there is no vertex y £ Q, such that both (y, Xi), (y, X3) G E. 



Proof: By contradiction, let us suppose that there exists a vertex y £ Q, such that (y,Xi), (y,X3) G 
E. In this case, since (xi,X3) 0 E, it results (y,X2) G E, otherwise the subgraph G{{xi,X2,Xs,y}) 
of G is a chordless cycle of length 4 . 

By Lemma 2, there exist two intersecting maximal adjacent cliques of Q, and hence, by Lemma 3 , 
IQI > 3 . In particular, each vertex w £ Q\{y} must be adjacent at least to Xi or X3, otherwise 
the subgraph G{{y,xi,xs,w}) is a claw. Moreover, being Q a maximal clique, neither xi nor X3 is 
adjacent to all the vertices of Q; hence, there exist at least two vertices w,z £ Q\{y}, such that 
(it;, Xi), (^, X3) G E, and (u;, X3), (2;, xi) ^ E. Let us show that this cannot happen proving the 
proposition. 

Being (xi,X3), (2:,xi), (it;,X3) ^ E, the subgraph G{{xi,X2,£3, z,w}) contains no chordless cycle of 
length at least 4 , if and only if both {w, X2), (2;, X2) £ E. If Q = {y, w, z}, then Q U {X2} is a clique. 
Therefore, let us suppose that Q = {y, w, z} U K, with ^ 0 . We have shown that each vertex 
of Q belongs to A^g({xi, X3}), hence, in particular, this is true for all u £ K. ^From this fact, it 
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also results that (^,0:2) G E", as shown next: if u is adjacent both to xi and X3, then (u,X2) € E, 
otherwise the subgraph G{{xi,X2,xz,u}) is a chordless cycle of length 4 ; if u is adjacent only to 
xi (xa), then {u,X2) G E, otherwise the subgraph G{{%Xi,X2,z}) {G{{w,X2,xz,u})) is a chordless 
cycle of length 4 . Hence, again, Q U {X2} is a clique. □ 

Theorem 1 Given a maximal clique Q of a claw-free chordal graph G = (V,E), if there exists an 
adjacent path PsiQ) = (xi,X2,X3) of Q, then there exist at least three vertices y,w,z of Q such that 
the subgraph G{Ps(Q) U {y, z)) is a 8 -sun. 

Proof: By Proposition 1, there is no vertex of Q adjacent both to x\ and X3. Hence, there exist at 
least two vertices y,z oiQ such that {y,xi),{z,xz) G E, and {y,xz),{z,xi) ^ E. Therefore, since 
(xi, X3) qL E, the subgraph G{Pz{Q) U {2/, z}) contains no chordless cycle of length at least 4 if and 
only if [y,X2),{z,X2) G E. 

By Lemma 2, there exist two intersecting maximal adjacent cliques of Q, and hence, by Lemma 3 , 
the set Q\{y,z} is not empty. In particular, if there exists a vertex w G Q\{2/, 2:}, such that 
w ^ NoiPsiQ)), the subgraph G{P3{Q) U {y,w,z}) is a 3 -sun. 

Let us show that such a vertex w exists, otherwise Q is not a maximal clique. By contradiction, let 
us suppose that each vertex v G Q\{y,z} belongs to Nq{P3{Q)). First, v cannot be adjacent both 
to Xi and xz, by Proposition 1 ; if (v,Xi) G E and {v,Xz) 0 E ((^,0:3) G E and (u,a;i) ^ E) the 
subgraph G{{v,Xi,X2, z}) {G{{v,Xz,X2,y})) contains no chordless cycle of length 4 , if and only if 
(v,X2) G E. Hence, if u G NciPsiQ)), then {v,X2) G E, and then, Q U {X2} is a clique. □ 

Now, let us consider G to be strongly chordal. It is known that a graph G is strongly chordal if and 
only if it is chordal and does not contain trampolines (Chang [2], Farber [ 7 ]). Since the 3 -sun is the 
only claw free trampoline, the following proposition holds. 

Proposition 2 A claw- free chordal graph G is strongly chordal if and only if it does not contain any 
subgraph isomorphic to the 3 -sun. 

^From Proposition 2 and Theorem 1 it follows that: 

Corollary 1 Each maximal clique of a CFSC graph has no adjacent path of length greater than 2 . 
Therefore, by Lemma 2 , it results: 

Theorem 2 For each maximal clique Q of a CFSC graph G, the maximal adjacent cliques of Q in 
G are mutually disjoint. 

4. The subgraph of the maximal adjacent cliques 

Given a maximal clique Q of a CFSC graph G = (F,E), let JC= {Ki, . . . ,E^} be the set of all 
the maximal adjacent cliques of Q in G. In this section, we study the structure of the subgraph 
G{Ki U • • • U Kq) induced by the vertices of all the maximal adjacent cliques of Q. 

In particular, we will show that G{Ki U • • • U Kg) is disconnected and decomposable into a set of 
connected components. 
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Lemma 4 Given a maximal clique Q of a CFSC graph G = (V^E), for any couple of maximal 
adjacent cliques Ki^Kj ofQ, there is no pair of vertices Ui G {Ki), Uj £ (Kj), such that (ui,Uj) e E. 



Proof: By Theorem 2, K{ fl Kj = 0. By contradiction, let us suppose that there exist Ui 6 Ki, 
Uj e Kj, for which {ui,Uj) € E. Now, if both Ui,Uj belong to Ng{Q), there exists another maximal 
adjacent clique K of Q, containing Ui and Uj, intersecting both K{ and Kj, but this cannot happen 
by Theorem 2. 

Therefore, w.l.o.g., let us suppose that does not belong to Ng{Q)> Since Ki,Kj are two maximal 
adjacent cliques of Q, there exist at least two vertices Vi £ (KiONGiQ)) and Vj £ (KjnNGiQ)), such 
that both edges (vi,Uj) and (ui,Vj) do not belong to E, otherwise Ki and/or Kj are not maximal 
adjacent cliques of Q. But, in this case, (vi,Ui,Uj,Vj) is a chordless path, and by Lemma 1, since 
Vi,Vj £ Ng(Q), both Ui,Uj £ Ng{Q), contradicting the hypothesis that Ui ^ Ng{Q)- □ 

Theorem 3 Given a maximal clique Q of a CFSC graph G = (V, E), and the set K = {K \, . . . , Kq\ 
of all the maximal adjacent cliques of Q in G, the subgraph G{K\ U • • • U Kq) is disconnected and 
decomposable into q connected components G{Ki), for i = 1 , . . . ,q, where each G{Ki) is a complete 
graph. 

Proof: By Lemma 4, there is no edge in E between vertices belonging to different disjoint maximal 
adjacent cliques of Q, hence the subgraph G{Ki U • • • U Kg) is disconnected and decomposable into 
q connected components G{Ki), fov i = 1, . . . ,q. In particular, G(Ki), for i = 1 , . . . ,q, is complete. 
□ 

By now, we have characterized the structure of the subgraph G{Q), with Q = Q U U • U Kq, and 
shown how it decomposes. 

Figure 1 shows the whole structure of a CFSC graph, where G{Q) is the gray subgraph of the graph. 
Let us analyze G{Q) (the remain part of the graph will be discussed later). It contains a maximal 
clique Q with its maximal adjacent cliques Ki, K 2 , K 3 , all mutually disjoint. Note that G{Ki UK 2 U 
Ks) is a disconnected subgraph of G, formed by 3 connected components G{Ki), G{K 2 ), G{Kz) which 
are complete subgraphs of G. 

5. Structural properties of CFSC graphs 

We have analyzed the structure of the subgraph G[Ki U • • • U Kq), induced by the maximal adjacent 
cliques of a given maximal clique Q of G; by definition, this subgraph contains G{Ng{Q)) as a 
subgraph. In particular, we have shown that there is no edge in G between any two vertices of 
different disjoint maximal adjacent cliques of Q. 

Moreover, given Q = QU KiU ‘ • - U Kq, in the sequel we show that there is no path between any two 
vertices V{,Vj belonging to different maximal adjacent cliques Ki,Kj of Q, respectively, containing 
vertices in V\Q. Therefore, it follows that, if the subgraph G(V\Q) is adjacent to more than one 
maximal adjacent clique of Q, G(V\Q) is disconnected, and each its connected component is adjacent 
to at most a single maximal adjacent clique of Q in G. 
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Figure 1: The structure of a CFSC graph. 



Let us start to show that: 

Lemma 5 Given a maximal clique Q of a CFG graph G = (V, E), there is no path P between any 
couple of vertices, belonging to different maximal adjacent cliques Ki, Kj of Q in G, such that P 
contains vertices in V\Q. 

Proof: By contradiction, let us suppose that there exists a path Pq between Vi and vj, containing 
vertices in V\Q. Moreover, by definition of maximal adjacent cliques, there exists at least a path 
Pi between Vi and Vj, formed only by vertices Ki U Kj and at least a vertex of Q. It is simple to 
see that the subgraph G(Pq U Pj) contains a chordless cycle of length at least 4, since by Lemma 4 
{vi, Vj) ^ E, and, by definition of Pq, there exists at least a vertex v e Pq such that v 0 Nc{Q)- □ 

Hence, from Lemma 5, it follows that: 

Proposition 3 Given a maximal clique Q of a CFG graph G = (V,E), each connected component 
of the subgraph G{V\Q) is adjacent to at most one maximal adjacent clique of Q. 

Let Gi = {Ui, Fi) be the collection (eventually empty) of connected components of G{V\Q) that are 
adjacent to the maximal adjacent clique Ki in G. ^From Proposition 3, it follows that: 

Theorem 4 Given a maximal clique Q of a CFG graph G = {V, E), the subgraph G{V\Q) is de- 
composable into q mutually disjoint and unlinked subgraphs Gi, . . . , Gq, such that if Gi is not empty 
it is adjacent to one and only one maximal adjacent clique Ki of Q. 

Let us denote with Hi = (Vi, Ei), for i = 1, . . . , the subgraph G{Ui U Ki). 

Since Ki is a maximal clique of G{V\Q), it follows that: 
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Corollary 2 K{ is a maximal clique of Hi. 

Let us complete the analysis of the structure of a CFSC graph G and its decomposition, starting 
from a maximal clique Q of G (see Figure 1 ). We have yet shown how G{Q) can be decomposed; 
G{V\Q) is decomposable into q — 3 disjoint and unlinked subgraphs Gi, . . . , G 3 , with Gi adjacent 
only to the maximal adjacent clique Ki of Q, and in particular G\ being disconnected and formed 
by 2 connected components which are both adjacent to Ki. 



6. Partition algorithm 

In this section, following the structural properties of CFSC graphs analyzed in the previous sections, 
we give a partition algorithm that partitions the vertex set F of a CFSC graph G = (V, jF) into 
cliques Q* of G, decomposes G, and constructs the adjacency graph Gd[G] = (Vd^Ed) of these 
cliques, where each vertex Xi € Vd represents a clique Qi of G, and {xi^xj) G Ed if and only if the 
subgraphs G{Qi),G{Qj) are adjacent in G. Let us call Gd[G] the decomposition graph of G. 

The algorithm operates according to the following considerations. Given a CFSC graph G = {V, E) 
and a maximal clique Q of G, let us considered the subgraphs G(Q) and G{V\Q)^ where we recall 
that Q = (Q, Ui^i U • • • U Kq), being Ki,...,Kq the maximal adjacent cliques of Q. 

Theorem 3 tell us that G{Q\Q) is disconnected and decomposable into q connected components 
G{Ki), for i = 1, . . . , in particular, G{Ki) is obviously a complete subgraphs. 

Therefore, partitioning Q into the cliques Q, Ki,. . .^Kq^ we have that the decomposition graph 
Gd[G{Q)] of G(Q) has X = {x,xi, . . . jXq} as vertex set, while the edge set is composed only 
by edges between the vertex x and the other vertices of X. 

Theorem 4 tell us that G(V\Q) is decomposable into a set of disjoint and unlinked subgraphs each 
one adjacent to exactly one maximal adjacent clique of Q. In particular. Corollary 2 states that K{ 
is a maximal clique of the connected subgraph Hi = G(Vi U Ki). Therefore, since, by definition, 
each subgraph of a CFSC graph is itself claw-free strongly chordal, we can independently reapply 
the previous considerations to each subgraph i/j, in a recursive fashion, in order to decompose Hi. 
W.l.o.g. let G be connected, otherwise the following has to be applied for each connected component 
of G. The algorithm starts considering a maximal clique Q of G, for example, a maximal clique of 
a simplicial vertex of G, where a vertex x of a graph G = (V, E) is called simplicial if the induced 
subgraph G{{y G F | (y,x) G jF}) is a clique. It is known that every chordal graph has a simplicial 
vertex, and this is can be found in linear time [ 8 ]. 

The algorithm, starting from Q, constructs the sets Vd = {x} and Ed = 0, with x representing Q 
in the decomposition graph Gd[G]. Then the algorithm partitions V, decomposes G, and returns 
Gd[G], starting from Q, recursively in the following way. It identifies the maximal adjacent cliques 
{Ki, ... ,Kq} of Q. It adds a vertex Xi to Vd and an edge (x,Xj) to Ed for each maximal adjacent 
clique Ki, with i = l,...,q. At this point, the algorithm has decomposed the subgraph G{Q). Then, 
the algorithm decomposes G(V\Q). In order to do that, it decomposes the subgraph Hi as a, new 
separated CFG graph, starting from Ki as a. maximal clique of Hi. Formally the partition algorithm 
is given next. 
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graph function Partition_CFSC_Graph(graph: G = iy,E))\ 
begin 

let Q be a maximal clique of G\ let x be the vertex related to Q; 
let Vd := {ic}; Let Ed := 0; 

Partition(G, Q,x); 
return(GD = {Vd^Ed)) 

end 

procedure Partition(graph: G, clique: Q, vertex: x); 
begin 

let {Ki, . ..,Kq} be the set of maximal adjacent 
cliques of Q in G; 
for each Ki G {Ki, . . . , Kg} do 
begin 

let Xi be the vertex related to Ki; 
let Vd := Vd U {xi}; let Ed := Ed U {(x, Xj)} 
end 

\etQ:=QU{KiU-^UKg); 
for each Ki € {Ki, . . . , Kh] do 
begin 

Let Gi = {Ui, Fi) be the subgraph of G, formed by the connected 
components of G(V\Q) adjacent to Ki in G; 

Let Hi = G(Ui U Ki); 

Partition (i/j, Ki,Xi) 

end 

end 

As for time complexity, the set of maximal cliques of G can be found in G(n -f m) time [8]. Much 
more time is required to find the maximal adjacent cliques of a given maximal clique. Using a simple 
list to represent a set of vertices, and considering that 0{n) time is required to compare two different 
sets, all the maximal adjacent cliques can be found in G(n^), since at most we have to mutually 
compare G(n) maximal cliques. Since this has be done at most n times, the overall time complexity 
of the partition algorithm is G(n^). 

Next, we investigate the structure of the decomposition graph Gd = (Vd, Ed) of a CFSC graph 
G = (V,E) in order to acyclically orient G. 

If G is a connected CFSC graph, it is simply to show that, by construction, the decomposition graph 
Gd is connected, too. ^From Theorem 2 it follows that in a CFSC graph all the maximal cliques 
have only mutually disjoint maximal adjacent cliques; therefore, it follows that: 

Theorem 5 The decomposition graph Gd of a CFSC graph G is a tree. 




151 



This latter result allows us to acyclically orient G in a very effective way, as shown in the sequel. 
Let us start to acyclically orient the decomposition graph Gd = (V^, Ed)- Being Gd a tree, and hence 
a special comparability graph, if we orient it according to any one of its transitive orientation, it results 
that each directed path is composed by 2 vertices. As a consequence, the oriented decomposition 
graph induces an acyclic orientation on G, in which each directed path is contained in at most two 
maximal cliques of G. 

7. Conclusions 

In this paper we have found a particular partition of the vertex set of claw-free strongly chordal 
graphs in which each element is a clique, and we have shown that the adjacency graph of these 
cliques is a tree. In particular, the presented results imply the existence of an ordering of the 
vertices, and a corresponding edge orientation, such that each directed path is contained at most in 
two maximal cliques. As shown by the authors in previous works, this allows to give performance 
guarantee approximation results on a wide class of optimization problems. 
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Extreme Rays of the Cones of Subadditive Functions 



Vladimir A. Shlyk 

Summary: The extreme rays of the cones of functions defined on the set N = {0, 1, . . . , n} and 

subadditive relative to the usual addition or addition modulo n -f 1 are studied. A large class of 
extreme rays of the cone of monotone subadditive functions and all extreme rays of the cone of 
concave functions on N are found. Established set-theoretic relations between the cones allow to 
lessen the area where the search of extreme rays of the first two cones is worthwhile. 

1. Introduction 

Let N he a. set closed with respect to a binary operation 0, which can be partially defined, and let 
R be the set of real numbers. 

A function F : N Ris called subadditive on the additive system (AT, 0) if 

F{x) + F{y) > F(x 0 y) (1) 

for all x,y e N for which x^y is defined. 

The interest to subadditive functions arose from the theory of integer linear programming (ILP). R.E. 
Gomory and E.L. Johnson [1, 2] were the first to establish the tight connection between subadditive 
functions and polyhedra on groups. Later, in the works of C.A. Burdet, E.L. Johnson, J. Araoz, R.G. 
Jeroslow, L.A. Wolsey, S.S. Lebedev, O.K. Sheinman, and others, the theory of subadditive duality 
for the ILP problems was constructed that has led to subadditive algorithms for the ILP problems. 
It was found that many important features of ILP problems, such as solvability, optimal solution 
value, valid cuts, and others, can be expressed in terms of subadditive functions. A brief survey of 
these results can be found in [3], where the problem of obtaining description of extreme rays of the 
cones of subadditive functions on different algebraic structures seems to be formulated explicitly for 
the first time. 

Subadditive functions are of interest not only in view of optimization problems. They form a class 
of functions one step more complicated than concave ones and are globally abundant, since the 
inequality (1) is of the most simple and encountered type of nonlinear relations. For example, every 
metric can be considered as a subadditive function on the set of pairs of space elements [4]. 
Subadditive functions on an additive system (AT, 0) constitute a closed convex cone in the space 
R^^^. Apparently, this cone will be described if its extreme rays could be characterized. The known 
result is a consequence of the subadditive characterization of facets of the master Gomory polyhedra 
on a finite abelian group G, which are defined as the sets 

F(G, b) = {x e RP \ QiXi = 6, X > 0, x integer} 

where G is a group of order n + 1 with the addition operation 0 g and 6 is a nonzero element 
of G. According to subadditive characterization, the coefficient vector of every nontrivial facet of 
the polyhedron P(G, b) for every b is an extreme ray of the cone of subadditive functions defined 
on the group (G, 0g). From the generalizations obtained thereafter for polyhedra on semigroups 
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and additive systems, similar results follow for subadditive functions on these algebraic structures. 
Extreme rays of the cones are not exhausted by the facet coefficient vectors, even for the case of 
polyhedra on groups. 

We restrict ourselves to the case when AT is a finite set of integers N = {0, 1, . . . , n} but consider 
two different addition operations: usual number addition -f partially defined on N, and addition + 
modulo n + 1. The cones of subadditive functions corresponding to these operations are denoted by 
S(n) and 5G(n), respectively. Together with these cones three lesser cones are studied: 

5+(n) - the cone of nonnegative subadditive functions on iV, 

SM(n) - the cone of monotone subadditive functions on AT, and 
C(n) - the cone of concave functions on N. 

The set-theoretic relations between these cones are established, a large class of extreme rays of the 
cone SM(n) and all extreme rays of C{n) are found. This allows to lessen the area where the search 
of extreme rays of the cones 5(n) and SG(n) is worthwhile. 

2. The cone of subadditive functions S{n) 

The cone of subadditive functions is the set 

5(n) = {F = (F(0), F(l), F(n)) 6 | F(i) + F(j) >F{i+j), i, j, i + j€N}. 

It is not hard to see that F is extreme in S{n) only if F{0) = 0. Hence, when searching extreme 
rays, we can assume that F(0) = 0 for all F G S{n). The following proposition shows that the 
cone S{n) is not pointed. It means that its maximal subspace L5(n), called lineality space of 5(n), 
is nonzero. Let E be the function defined as E{i) = i for every i e N. Denote by L{E) the line 
{F e I F = XE, A G F} generated by E. 

Proposition 1 The lineality space of the cone S{n) is equal to L{E). 

Proof. It is clear that L{E) C L5(n), and if F G S(n) and F ^ E, than F{i) > 0 for all i e N. If 
LS(n) ^ L{E) then there exists a function F G LS{n) such that F(i + 1) — F{i) ^ F(l) for some 
i,l < i < n - 1. It means that F, in distinction to F, has at least two slopes. Since F G LS{n) the 
same is true for the function — F. Now one can easily derive a contradiction with subadditivity of 
-F. □ 

Denote by the nonnegative orthant of 

Corollary 1 Every function F G 5(n) can be decomposed in a sum of two functions F = Fl + F+, 
where Fl G L{E) and F.^ is a nonnegative subadditive function. In other words, S{n) = L(F)+5+(n) 
where S+{n) = S{n) D and S^{n) is a pointed cone in 

Thus, the problem of obtaining a description of the cone of all subadditive functions S(n) can be 
reduced to the search of extreme rays of the cone of nonnegative subadditive functions 

5+(n) = {F G |F(0) = 0, F(0 + F(;) > F(z + ;), i + j G AT \ {0}}. (2) 
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Extreme rays of any cone of dimension k defined by a system of linear inequalities are exactly those 
cone elements that turn k — 1 linear independent inequalities into equalities. These inequalities are 
called basic. An inequality that is turned into equality by a given cone element is called active for 
this element. For example all inequalities of the type (1) from (2) are active for the extreme ray E 
of 5+(n). When searching extreme rays, functions F and aF for a > 0 can be identified. 

Proposition 2 For every function F being extreme ray of S+{n), except E, there exists an integer 
i^2 < i < n, such that F{i) = 0. 

Proof Let F be extreme in 5+(n) and F ^ E. On the contrary, suppose that all its basic inequalities 
are of the form (1). Let M be the coefficient matrix of these inequalities together with the equality 
F{0) = 0, and let F^ be the vector (F(0), F(l), ..., F(n)) transposed. Then MF^ = 0. On the other 
side, ME'^ = 0 and the system MX = 0, up to a multiplier, has unique solution. Hence F = aE for 
some a > 0. □ 

A lifting rule for extreme rays of S+{n) can be formulated. 

Proposition 3 Let a function F be extreme in the cone S+{n) and let the function G : + 

1} -> jR 6e an extension of F defined by the rule: G{i) = F{i) for i e N, and G{n + 1) = 0 or 
G(n + 1) = min {F(n -\-l — k) -F(A;)}. Then G is an extreme ray of the cone 5+(n + 1). 

The proposition can be proved by adding a row of coefficients of arbitrary equality containing G(n+1) 
to the coefficient matrix of the basic inequalities of the function F and showing that the rank of the 
new matrix is equal to n + 1. 

3. The cone SG(n) of subadditive functions on group 

As for 5(n), it can be assumed that F(0) = 0 for all F € SG{n). Inequalities (3) imply nonnegativity 
of all F € SG{n). Extreme rays with F(0) > 0 can be obtained from those with F(0) = 0 by setting 
F(0) = max|F(i) + F{j) | i, j € AT, i+j = o}. Thus the cone of subadditive functions SG{n) on 
the group (AT, +) can be defined as 

SG{n) = {F € I F(0) = 0, F(i) + F{]) > F(i+j), ij € N} (3) 

where i+j = i-\- j mod(n -I- 1). 

Note that E is an extreme ray of SG{n) again. Since (3) contains more inequalities than (2), 
the inclusion SG{n) C S+{n) holds. The next proposition can be proved in the same fashion as 
proposition 2. 

Proposition 4 If a function F is an extreme ray of the cone SG{n) then there exist i^jeN, i+j > 
n, such that F{i) H- F{j) = F{i^j). 

We attempt to get information about extreme rays of the cones S+{n) and SG(n) by studying two 
lesser cones of subadditive functions. 
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4. The cone of monotone subadditive functions SM(n) 

The cone of monotone (nondecreasing) subadditive functions on N is the set 

5M(n) = {F € i"(0) = 0, F(i)>F( 2 -l), F(i) + F(j) > F{i ^ j). hj.i~^jeN\{0}}. 



We impose the restriction F(0) = 0 from the outset this time. The argumentation is analogous to 
that in the previous sections. 

The inequality F(i) + F{j) > F{i^j) for i + j > n readily follows from the monotonicity of the 
functions F € SM{n). Therefore SM{n) C SG{n). As before, E is an extreme function in SM(n). 
The next two propositions can be proved in much the same manner as propositions 2 and 3. 

Proposition 5 For every extreme function from the cone SM(n), distinct from E, there exists an 
integer i, 2 <i <n, such that F{i) = F{i — 1). 

Proposition 6 Let a function F : N R be an extreme ray of the cone SM{n) and let the function 
G : iV U {n 4- 1} R be an extension of F defined by the rule: G(i) = F(i) for i € N, and 
G(n + 1) = F{n) or G{n + 1) = min^ ^ {F(n + 1 — A;) 4- F{k)}. Then G is an extreme ray of the 

cone SM{n 4- 1). 

Definition 1 A function F : N R is called a step function if F{i) — F{i — 1) = 0 or F(l) for all 
i > 0. 

Theorem 1 All subadditive step functions on N are extreme rays of SM{n). 

Proof. All subadditive functions for all i, 2 < i < n, satisfy one of the following relations: 
F(i) = F(i — 1) or F(i) = F{i — 1) 4- F(l). Satisfied equations can be taken as basic ones together 
with F(0) = 0. It is easy to show that the rank of the coefficient matrix corresponding to the system 
obtained is n — 1. Therefore, if F is subadditive then it is extreme in SM{n). □ 

Not all extreme rays of SM{n) fall into the class of step functions. For n = 6 the following functions 
are examples of extreme rays which are not step functions: (0, 2, 3, 4, 6, 6, 8), (0, 2, 2, 3, 4, 5, 6), 
and (0, 2, 2, 3, 4, 4, 6). Proposition 5 implies that such extreme functions exist in SM{n) for all 
n > 6. The attempts to elucidate their structure have not been successful so far. We present only 
one result that refers to the second of these ’’exceptional” functions. 

Proposition 7 Let m > 3n and let F be an extreme ray of SM{n) such that F{j) > jF{n)/n for 
o>ll j’, 2 < j < n. Let G be the function defined by the rule G{i) = F{i) for i < n and G{i) = iF[n)/n 
for z, n < i <m. Then G is an extreme ray of SM{m). 

Proof The proposition is proved via considering the coefficient matrix of n - 1 basic inequalities 
for the function F and the following m — n equalities: 

2G{n) = G(2n), G{n) 4- G{n 4- 1) = G{2n -hi),..., G{n) 4- G{m - n) = G(m); 



2G(n 4- 1) = G{2n + 2), G(n 4- 1) 4- G{n 4- 2) — G{2n 4- 3), . . . , G{n 4- 1) 4- G{2n — 1) = G{3n). 
One readily checks that its rank is equal to m — 1. □ 
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5. The cone of concave functions C{n) 

Recall that a function F \ N ^ R is called concave if the inequality F{j) -h (fc — z) < F(k) 

holds for all i,j,k e i < k < j. The following criterion is more convenient: a function F is 
concave if and only if F(i) - F{i - 1) > F{i + 1) - F{i) for alH, 0 < z < n. Concave functions on 
N form a cone in As before, when searching extreme rays, we can assume that F(0) = 0. The 
line L{E) is again its lineality space and every concave function F can be decomposed into a sum 
F = Fl-\-F+ where Fl € L(E) and F+ belongs to the pointed cone of nonnegative concave functions 

C{n) = {Fe I F(0) = 0; F{j) > 0, j 6 AT; F(z) - F(i - 1) > F{i + 1) - F(z), 0 < z < n}. 

All extreme rays of C(n) are characterized in the next theorem. 

Theorem 2 Extreme rays of the cone C{n) are: the function E and the functions Fk, 0 < k < n, 
defined as Fk{i) = i/k for 0 <i < k and Fk(i) = for k < i <n. 

Proof Let F € C{n) and F ^ E. We prove that F is a convex combination of E and the functions 
Fl, F 2 , . . . , F„_i defined above. Let zi < Z 2 < . . . < it be all points where F changes its slope, i.e. 
such points z, 0 < z < n, that F(z) - F(z — 1) > F(z + 1) — F(z). 

Decomposition of F. Let F(n) > 0 (this step should be skipped if F(n) = 0). Define two functions 

F^P and Fdn from C{n) as 

F^P(j) = FdnU) = F(j) for 0 < j < it; 

F-P{j) = F(it) + {j - it)(F(it) - F(it - 1)) and 
Fdn(j) = ^,F{it) for it< 3 < n. 

The equality F = + (1 - holds, hence F is a convex combination of F“p and 

Fdn- Note that the function F“^ changes its slope only in t — 1 points and Fdn{n) = 0. 
Decomposition of F“^. F“^ can be decomposed similarly. Eventually there will be one function 
not changing its slope, which belongs to L(F), and some functions like Fdn {Fdn(n) = 0) in the 
decomposition of F. 

Decomposition of Fdn- Define two functions Fdni and Fdn 2 from C(n): 

Fdniii) - Fdnlij) = Fdn(j) for 0 < j < it-l] 

FdniU) = Fdn(it-i) + O' - it-i){F{it-i) - F(it-i - 1)) for it-i <j< it, 

Fdniii) = :f^i^,Fd„i{it) for h < j < n; 

Fdn 2 ii) = for it-i <j<n. 

One can check that Fdn = oj-fdni + (1 ~ oc)Fdn 2 , where a = convex 

combination of Fdni and Fdn 2 - The functions Fdni and Fd „2 are of the same type as Fdn but have 
t-1 slope change points: ii, ^ 2 , • • • , h- 2 ^ h for Fdni, and Zi, Z 2 , . . . , it-i for Fdn 2 - 
Finally we have that Fdn is a convex combination of some functions Fj^ , Fj 2 , . . . , Fi^ which are con- 
tained among the functions Fi, F 2 , . . . , Fn_i since each of them has only one slope change point. 
Therefore, the function F is a convex combination of the functions F, Fj^, Fj 2 , . . . , Fj^. 
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To end the proof it remains to note that none of the functions E, Fk,0 < k < n, can be decomposed 
into a convex combination of others. □ 

6. Connections between cones 

As already noted, SM{n) C SG(n) C S+{n). It could be expected that some extreme rays of 5M(n), 
distinct from E, were extreme rays of SG(n) or S^{n) (and therefore of SG{n)) but the propositions 
2 and 4 shut up this possibility. 

It is not difficult to check that C{n) C S^{n). Since n = 0 in the group {0, 1, ..., n - 1} with addition 
modulo n, obtained extreme rays of the cone C(n) are the Gomory functions, or the coefficient 
vectors of the Gomory facets of the master group polyhedra on this group [1]. As it was noted in the 
introduction, these functions are extreme rays of the cone SG(n — 1). Therefore, the cone C(n) is 
the convex hull of the function E and the subcone of 5G(n — 1) generated by the Gomory functions. 
The next theorem holds: 

Theorem 3 The extreme functions of the cone S+(n), distinct from E, are nonmonotone. The 
extreme functions of the cone SG(n), other than the Gomory functions, are nonmonotone and non- 
concave. 
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Summary: We investigate the complexity and approximability of network flow improvement prob- 
lems. In these problems, one incurs costs for increasing the capacity of an edge, while the goal is to 
achieve a flow of maximum value through the network. We study several improvement strategies. 
Furthermore, we investigate the relationship of network flow improvement problems to network fixed 
cost flow problems, where one incurs a fixed charge for each edge with nonzero flow which does not 
depend on the amount of flow sent over the edge. All fixed cost problems studied in this paper are 
NP-hard to solve. We present various approximation algorithms for the problems under study. 

1. Introduction 

Minimum cost flows play an important role in operations research. They can be used to model a 
large number of transportation, logistics and communication network design problems. Usually, the 
flow costs are linear or convex, that is, one is charged Ce(t) units of money if sending t units of flow 
over edge e, where each Cg is a linear or convex function. 

However, linear or convex cost functions do not reflect the problem nature appropriately in a number 
of applications. Often, there are either start-up costs or fixed costs involved in production planning 
[GP90]. Consider for instance the problem of leasing communication lines in a network. A fixed 
charge occurs for each link that is rented, no matter how much trafiic actually passes through this 
link. 

In this paper we investigate such network flow problems with fixed costs, where a cost arises for each 
edge with nonzero flow and which does not depend on the amount of flow on that edge. We also 
investigate flow problems where a budget can be used to increase capacities in the network. Here, 
the goal is to improve the network such that the maximum flow with respect to the new capacities 
is maximized. 

2. Problem Definition and Preliminaries 

Unless otherwise stated, by G = (V, E) we denote a directed graph with node set V and edge set E. 
We write n := \V\ and m := \E\. By s and t, we denote two distinguished nodes, the source and the 
sink, respectively. 

Definition 1 (MaxFlowImp) Let G = (V, E) be a graph. For each edge e e E, let Ue > 0 be its 
capacity, Ue > Ue its maximum capacity, and be be the cost of increasing its capacity be one unit. A 
valid improvement is a function i: E Q>o such that Ue + i{e) < Ue for each edge e, and the total 
cost T,eeE *5 boundcd by budget value B. Find a valid improvement i, such that the flow from s 
to t in the graph with capacities given by Ue + i{e) for edges e is maximized. 

We further distinguish the problem MaxFlowImp according to the valid values of the improvement 
strategy. For Continuous-MaxFlowImp, improvement i{e) can take any rational number within 
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the capacity constraints. For Integer-MaxFlowImp, the improvement is restricted to integer 
numbers. Finally, for 0/1-MaxFlowImp, improvement i{e) is restricted to the two values 0 and 

Ue — Ue. 

It turns out that Continuous-MaxFlowImp and even Integer-MaxFlowImp can be solved in 
polynomial time, while 0/1-MaxFlowImp is NP-hard. Furthermore, we will show the equivalence 
between 0/1-MaxFlowImp and the following problem: 

Definition 2 (MaxFlowFixedCost) Given a graph G = (V, E) with nonnegative capacities Ue 
and nonnegative costs Cg for the edges e £ E, find an edge subset AC E of cost Z)eeA ^ B, such 
that in (V, A) the flow from the source s to the sink t is maximized. 

This problem is a bicriteria optimization problem where the cost is constrained and the flow is 
maximized under that constraint. In this context a generic bicriteria maximization problem II = 
(/, p, r) on a weighted graph is deflned by specifying two polynomial time computable objectives, / 
and g and a membership requirement in a class of weighted subgraphs F (not necessarily weighted 
exactly the same way as the original graph). An instance of the problem specifies a budget value B 
as upper bound for the objective g. The goal is to And a subgraph from the set { x G F : p(x) < 
5, i = 2, . . . , /c } having maximum possible value for /. 

Definition 3 A polynomial time algorithm for a bicriteria maximization problem II = (/, g, F) is 
said to have performance if it has the following property: For any instance ofU the algorithm 

1. either produces a solution x from the subgraph class F for which the value of objective g is 

OPT* 

at most times the specified budget and where OPT is the minimum value of a 

solution from F that satisfies the budget constraint, or 

2. correctly provides the information that there is no subgraph from F which satisfies the budget 
constraint on g. 

An approximation algorithm with performance (a, 1) is usually also said to have performance a and 
referred to as an a- approximation algorithm. 

The dual problem II' to the maximization problem II = (/,p, F) is a minimization problem ob- 
tained by exchanging the two criteriae. The problem II' consists of finding a subgraph from the 
set { X G F : /(x) > 5' } having minimum possible value for g. In our particular case, the dual of 
MaxFlowFixedCost is defined as follows: 

Definition 4 (MinCostFixedFlow) Given a graph G = (V, E) with nonnegative capacities Ug 
and nonnegative costs Cg for the edges e £ E, find a minimum cost subset AC E of the edges of G 
such that in (V, A) the flow from the source s to the sink t is at least F. 

The definition of an (a, /5)-approximation algorithm for the minimization problem II' is similar to 
the one given in Definition 3. As shown in [MR’‘'95, Kru96] there is a tight relation between the 
complexity of a bicriteria problem II and its dual II': The problem II is solvable in polynomial time 
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if and only if II* is. Moreover, the problems are closely related with respect to their approximability. 
We refer to [RM'‘‘93, MR'^OS, Kru96] for a more detailed treatment of bicriteria problems. 

We show that MinCostFixedFlow is NP-hard even on series-parallel graphs. On the other hand, 
we provide a FPAS for that graph class. For general graphs, we provide a simple algorithm with 
approximation factor F, where F is the amount of flow to achieve. 

3. Polynomial Time Solvable Flow Improvement 

The problem Continuous-MaxFlowImp can be solved optimally in polynomial time by the fol- 
lowing easy extension of the well known Linear Programming formulation for the maximum flow 
problem. 



maximize F 



subject to 



Y^(v,w )eE f{v, W) - E{„,„)6E /(«> v) 



F if V = s 

0 for all V € \ {s, t} 

—F ifv = t 



0 < /(e) < Ue 4- i(e) for all e € F 
0 < Ue + i{e) < Ue for all e € F 
^(e)6e ^ F. 

Our main result of this section is to show that MaxFlowImp can also be solved efficiently if we 
require the improvement strategy to be integral. 



Theorem 5 Integer- MaxFlowImp can be solved optimally in polynomial time by 0{\og{nU)) 
minimum cost flow computations on a graph with 2m edges, where 

U := max{ C/g : e € F } (1) 

is the maximum capacity occurring in the input. 



Proof: The crux of the proof is to show that Integer- MaxFlowImp can be transformed in 
polynomial time into a budget-constrained minimum cost flow problem. 

Let i* be an optimal improvement strategy and f* be a corresponding maximal flow. For each 
edge e, since be is nonnegative, we have i*{e) = max{0, /*(e) — Wg}; otherwise the strategy would 
waste money. 

We can model this behavior by a flow cost function Cg deflned as follows. As long as one sends flow 
along an edge e of value at most Ug, there are no costs. But to send more flow along this edge, one 
has to pay be units of money for each unit of flow exceeding the old capacity Wg. This results in the 
following piecewise-linear flow cost function Cg (e G F): 



Ce(/) = 



0 for 0 </ < Ug 

be' if - Ue) for Ue <f < Fg 



( 2 ) 



Although the cost functions Cg are not linear, they have the nice property to be convex. We will 
show in the sequel how to exploit this fact. 
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But first note that our MaxFlowImp problem is equivalent to finding a maximum flow of cost at 
most B in the graph with upper capacities Ue (e G E) and nonlinear flow cost functions Cg as defined 
above. We call this equivalent problem a budget- constrained minimum cost flow problem. 

Our next goal is to relinearize the cost functions. To achieve this goal we use a method described in 
[AM093]. Each edge e G E* is replaced by two parallel edges eo and e\. The capacities Ug and linear 
flow costs Cg of these edges are set as follows: 

UgQ := Ug iXgj := C/g — Ug 

^eo •” 0 ^ei •“ 

The validity of the transformation follows easily from the convexity of the flow cost functions Cg. 

Let F* be the maximum flow value achievable for a budget of B. The value F* can be determined 
by a binary search which finds the largest integer F G [0, nU] such that there exists a flow of value F 
with costs at most B. This binary search needs 0(\og{nU)) minimum cost flow computations. □ 

3.1 Approximate Solution 

Instead of performing a binary search on the interval [0,nC7] we can search the interval only in 
multiplicative steps of 1 + e, where e > 0 is a fixed accuracy parameter. More formally, we And the 
maximum value 

F' € {1, 1 + e, . . . , (1 + e)*}, where k = riog(i+£)(ni^)l . (4) 

such that there exists flow of value F' of cost at most B. The value F' found by this modifled binary 
search satisfies F' > F*/{l-\-e). We thus obtain the following theorem: 

Theorem 6 For any fixed e > 0, a {1 -\-e)- approximation for Integek-MaxFlovjImp can be found 
by 0{\og\ogi^^{nU)) minimum cost flow computations on a graph with 2m edges. □ 

3.2 Nonlinear Cost Functions 

It should be noted that the techniques presented above can be extended to the case when the original 
cost functions be given in the speciflcation of MaxFlowImp are piecewise-linear convex functions 
instead of linear functions. Then, again the cost functions Cg constructed in the first transformation 
are also convex piecewise-linear and in the final step we can replace each edge by a set of parallel 
edges (each with linear cost function) where each of the parallel edges corresponds to one piece of 
the cost function Cg. 

Thus, Theorem 5 and 6 carry over to the more general case of piecewise-linear convex cost functions be. 
However, each of the minimum cost flow computations must now be carried out on a graph with 
0{gm) edges, where g is the maximum number of breakpoints occurring in the piecewise-linear cost 
functions. 

4. Flow Improvement and its Relation to Flows with Fixed Costs 

In this section, we examine the remaining variant 0/1-MaxFlowImp of the problem under study. 
Since it is more convenient, we will deal with the dual version 0/1-MinImpFlow of the problem. 
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As pointed out in the introduction, hardness and approximation results then carry over to 0/1- 
MaxFlowImp. 

Definition 7 (0/1-MinImpFlow) Given a graph G = (V^E) with edge capacities Ue > 0, max- 
imum capacities Ue > Ug, and capacity improvement costs be, find a improvement strategy i: E 
{0, Ue — Ue} of minimum cost T,eeE * (Ue ~ Wg), such that the graph with edge capacities given by 
Cg + i(e) admits a flow of value F from s to t. 

We first show the equivalence of 0/1 -MinImpFlow to MinCostFixbdFlow. 

Theorem 8 0/1-MinImpFlow is equivalent to MinCostFixedFlow. 

Proof: Let I = (G,u,U,b,F) be an instance of 0/1-MinImpFlow. We construct an instance 
r = {G\ u\ o', F') for MinCostFixedFlow in the following way. To obtain G' from G, replace 
each edge e by two parallel edges e[ and e^. Set the capacities of the edges to u'{e[) := Wg and 
u(e 2 ) := Ue - Ue, and the costs to c'(e'i) := 0 and cf{e 2 ) := be • (Ue — Ug), respectively, and let 
F' := F. Then a solution of /' for MinCostFixedFlow with cost B implies a solution of I for 
0/1-MinImpFlow with the same cost. 

Conversely, let /' = (G', u', cf, F') be given. Set G := G', F := F', u := 0. For each edge e', set 
Ue := u'(e') and be := d(e')/u\e). Then, the equivalence is immediate. □ 

As we will show in Section 6, MinCostFixedFlow is NP-hard. Hence we can not expect to find any 
polynomial time algorithm which solves the problem to optimality. Thus in the next section we will 
concentrate on designing approximation algorithms for MinCostFixedFlow. Due to the results of 
Theorem 8, our approximation results for MinCostFixedFlow carry over to 0/1-MinImpFlow. 

5. Approximation Algorithms for Flows with Fixed Costs 

5.1 An FPAS for MinCostFixedFlow on Series-Parallel Graphs 

A family {Ag}g of approximation algorithms for a problem H is called a fully polynomial approxima- 
tion scheme or FPAS, if algorithm is a (1 + e)-approximation algorithm for II and its running 
time is polynomial in the size of the input and 1/e. 

In this section we first present a pseudo-polynomial algorithm for MinCostFixedFlow on series- 
parallel directed graphs. Then, we show how to convert this algorithm into a FPAS by scaling 
techniques similar to those given in [SK98]. 

First we recall the recursive definition of series-parallel graphs (cf. [BLW87]): 

Definition 9 (Series-Parallel Graph) The directed graph G with vertex set {a, b} and edge set 
{(a, 6)} is series-parallel with terminals a and b. If G\ = (Vi,Ei) and G2 = (V 2 ,E 2 ) are series- 
parallel graphs, with terminals ai, bi and 02 , b 2 , respectively, then 

1. The graph obtained by identifying 02 and bi is a series-parallel graph, with ai and 62 as its 
terminals. This graph is the series composition of G\ and G 2 . 
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2. The graph obtained by identifying oi and 02 and also bi and 62 is a series-parallel graph, the 
parallel composition of G\ and G 2 . This graph has ai(= 02 ) and 61 (= 62 ) as its terminals. 

In [VTL82] the authors present a linear time algorithm to decide whether a given digraph is series- 
parallel, and if this is true, produce a parse tree (or decomposition tree) specifying how G is con- 
structed using the above rules. The size of the parse tree is linear in the size of the input graph. The 
following lemma can be proven by an easy induction on the size of the parse tree of a given graph. 

Lemma 10 Let G = (V, E) be a series-parallel graph. Let s,t € V be two different vertices and 
denote by G' the graph obtained from G by removing all the vertices that are not reachable from s or 
from which one can not reach t. Then G' is series-parallel with terminals s and t. □ 

In view of Lemma 10 we will assume in the sequel without loss of generality that for the given 
series-parallel graph G the two terminals coincide with the source s and the sink t between which 
the flow is to be maximized. 

Let C = maxeef; Cg be the maximum cost of an edge in the graph and let B e [ 0 , mC] be an integral 
budget value allowed for the flxed costs of a flow. The value B will act as “guess value” for the 
optimum cost in the final algorithm. Notice that the optimum fixed cost is an integer between 0 
and mC. 

For 0 < 6 < JB we define Coib) to be the maximum flow that can be achieved by using edges of 
total cost no more than b. In our algorithm we first use the algorithm from [VTL82] to obtain a 
decomposition tree for the input graph G in time 0(n -f m). We then use dynamic programming 
and the decomposition tree to compute all the values Caib), / = 0, . . . , 5 in 0(mB^) time. 

Clearly, if G consists of just the two vertices s and t joined by an edge (s, t), we can trivially compute 
all the values Coib) in 0{B) time. On the other hand, if G is the series composition of G\ and G 2 , 
then for 6 = 0 , . . . , B we have 

Coib) = m^^mm{Ca,{i)^CG 2 {b - i)}. (5) 

Similarly, if G is the parallel composition of G\ and G 2 , then 

Caib) = Cg.W + Ca,(b - i). (6) 

Since the size of the parse tree for the series-parallel graph G was assumed to be 0(m), the dynamic 
programming algorithm using the recurrences (5) and ( 6 ) terminates in O(mB^) time having correctly 
computed all the values Coib), b = 0, ... ,B. By also keeping track of the respective edge sets we 
can also obtain the corresponding edge sets. 

Let A(G, u, c, B) be the algorithm from above that returns a set A of edges of cost c(A) at most B 
such that the flow in (V, A) with capacities given by u (restricted to A) is maximized. 

Let G, u, c and F be as specified for an instance of MinCostFixedFlow and let C again denote 
the maximum capacity. Let e > 0 be a given accuracy requirement. Now consider the following test 
for a parameter M G [1, nC]: First we scale all edge costs in the graph by the factor Me/m, i.e., we 
set 




( 7 ) 
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We then run A(G, u, c^, (l+l/e)m). We call the test successful if the algorithm gives the information 
that the flow value F can be achieved by edges of cost at most (1 + l/£:)m. Observe that running 
time of A(G, u, c^, (1 + l/€)m) is 0(m^(l H- l/e)^). 

Denote by OPT the minimum flxed cost in the original graph and let A* be a corresponding edge 
set. We now show that the test is successful if M > OPT. For such a value of M we have 

E <^i^) ^ E + 1^*1 - (8) 

eeA* eeA* ' 

Thus the edge set A* is a feasible solution for the scaled instance yielding a flow of value F. Conse- 
quently, the test will be successful. 

We now use a binary search to And the minimum integer M' € [0, mC] such that the test described 
above succeeds. Our arguments from above show that the value M' found this way satisfles M' < 
OPT. Let A' be the corresponding edge-set found by A(G,u,c^, (1 + l/e)m) which yields a flow of 
value at least F. Then 

E Ce < ^ E cf' < ^(1 + 1/e)”* < (1 + e)OPT. (9) 

Thus, the edge set A' found by our algorithm has cost at most 1 + e times the optimum cost. The 
running time of the algorithm can be bounded as follows: We run O(logmC') tests on scaled instances, 
each of which needs 0(m^(l + 1^)) time. Thus, the total running time is 0(m^(l H- 1/e) logmC), 
which is bounded by a polynomial in the input size and 1/e. We summarize our results in the 
following theorem: 

Theorem 11 There is a FPAS for the problem MinCostFixedFlow when restricted to series- 
parallel (directed) graphs. □ 

By a similar proof, one can obtain the following result for the dual problem: 

Theorem 12 There is a FPAS for the problems MaxFlowFixedCost and (?/i-MAxFLOWlMP 
when restricted to series-parallel graphs. □ 

5.2 An Approximation for General Graphs 

In this section we present an approximation algorithm for MinCostFixedFlow on general graphs 
with performance F. This algorithm works both for the directed as well as for the undirected case. 
The algorithm works as follows: Compute an integral minimum cost flow (p of value F in the graph G 
from s to t where the capacities Ue of the edges are as given and the cost of one unit of flow over edge e 
is deflned to be Ce/ug. Let = T>eeE ^ denote the cost of the integral flow (p:E N 

computed this way. 

First notice that < OPT, that is, the flow cost is bounded above by the optimum cost of a set 
of edges that allow a flow of value F. We now define the edge set A := {e E E : (p{e) > 0 }. Then 

Ece = E«v < ^’E ^ < -FE < F-OPT. 

eeA eeA “e eeA “e eeA “c 



(10) 
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Thus, the set A has total cost at most F • OPT. It remains to show that A is also feasible. For any 
cut S(S) separating the source s and the sink t, we have 

E “e> E He)>F, (11) 

C6«5(5)n>l eeS(S)nA 

since cp is a flow of value F. We thus have the following theorem: 

Theorem 13 An approximation with performance F can be found in time 0(m + where 

TmcF needed to compute a minimum cost flow of value F in the input graph with 

capacities Ue and per unit flow costs of Ce/ue. □ 

6. Hardness Results 

We first show the hardness of MinCostFixedFlow even when restricted to the class of series- 
parallel graphs. Notice that for the decision version of the dual problem, MaxFlowFixedCost, 
the NP-hardness on general graphs is already known [GJ79, Problem ND32]. 

Theorem 14 MinCostFixedFlow is NP-hard even on series-parallel graphs. 

Proof: We show the lemma by a reduction from Knapsack. An instance of Knapsack is given 
by a finite set A = of items, each with weight w{ai) > 0 and value l{ai) > 0, and 

two integers W and L. It is NP-complete to decide whether there is a subset A' C A such that 
w{A') < W and 1{A') > L [GJ79, Problem MP9]. 

Given an instance of Knapsack, we construct a graph with vertex set {s,t} joined by \A\ parallel 
edges. For item a*, edge has cost w(ai) and capacity v{ai). We set the flow constraint to the 
minimum value L of the knapsack. 

It is easy to see that the instance of Knapsack has a solution if and only if there is a solution of 
MinCostFixedFlow by a selection of edges of cost at most W. □ 

Before stating the hardness result we recall the definition of the MinSetCover problem [GJ79, 
Problem SP5] and cite the hardness result from [Fei96] about the hardness of approximating Min- 
SetCover. 

An instance (M, F) of MinSetCover consists of a finite set M of ground elements, a family T of 
subsets of M^. The objective is to find a sub-collection C C F of minimum size \C\ which contains 
all the ground elements. 

Theorem 15 ([Fei96]) Unless NP C any ^ > o there is no approximation 

algorithm for MinSetCover with a performance of (1 - £:)ln|M|, where M is the set of ground 
elements. □ 

Theorem 16 ([AS97]) There exists a constant rj > 0 such that, unless P = NP, there is no approx- 
imation algorithm for MinSetCover with a performance ofrjhL \M\, where M is the set of ground 
elements. □ 



^Without loss of generality we assume that each element of Q belongs to at least one subset in F. 
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Theorem 17 MinCostFixedFlow is strongly NP-hard even on bipartite graphs. Unless NP C 
for any e >0 there is no approximation algorithm for MinCostFixedFlow 
on bipartite graphs with performance guarantee of {\-e) InF, where F is the given flow value to be 
achieved. 

Proof: We show the theorem by providing an approximation preserving reduction from the Min- 
SetCover problem. Given an instance of MinSetCover, we first construct the natural bipartite 
graph, one side of the partition for set nodes and the other for element nodes. We insert an edge 
(Q, if ^ 6 Q. All these edges have capacity 1 and zero costs. 

We now add a source node s and a sink node t to the graph. The source node is joined to all the set 
nodes via edges (s, Q) {Q G F). We set U(^s,q) := \M\ and C(a,g) := 1. For each element q there is an 
edge {q, t) from q to the sink with U(g^t) '•= 1 and C(g^t) •= 0* Let us denote the resulting graph by G. 
Finally, we set the flow value F to be the size \M\ of the ground set. 

Since the cost of any selection E' of edges is exactly the number of edges in E' emanating from the 
source, we can assume without loss of generality that each such set F' contains all zero-cost edges 
between sets and elements and the elements and the sink. 

It is now easy to see that the sets {Q : (s,Q) G E^} form a cover of M if and only if the flow 
in (V,E^) has value \M\. Thus, a set cover of size K transforms into a feasible solution for Min- 
CostFixedFlow of the same cost and vice versa. □ 
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Approximative Evolutionary Algorithm for the Probabilistic 
Traveling Salesman Problem 



S. Rosenow, Universitat der Bundeswehr, Hamburg 



1. Introduction 

The formal structure of the probabilistic traveling salesman problem (PTSP) is a generalization of 
the traveling salesman problem (TSP). The basis of the PTSP is a TSP with stochastic customers 
(TSPCS), so that the traveling salesman has to visit city i in a tour only with probability pi — 
corresponding to the actual demand of the customers. 

The PTSP should not be confounded with the TSPCS. The TSPCS is the specification of the problem, 
while the PTSP is an algorithmic approach. It was first described by Jaillet [1985]. There are 
two components: the a priori solution and the updating method. The a priori solution combined 
with the updating-method gives an approximate solution for every instance of the TSPCS. Here we 
will consider the most frequently discussed case: the a priori solution is a tour, which minimizes the 
expected length over all possible instances and the updating method for the traveling salesman is to 
delete the customers with no demand, and to visit the remaining cities in order of the a priori tour. 
To find the optimal solution for the PTSP is of the same complexity as the deterministic TSP. Here 
for the PTSP a branch-and-bound approach similar to the known algorithms for the TSP will be 
developed. Also a heuristic based on evolutionary methods will be presented as an example for an 
approximate solution. 

2. The Model 

We will specify the model as described in Rosenow [1997]. The coordinates of the cities 1, . . . , n are 
Euclidean. Random variables Xi £ {0, 1}, z € / = {1, . . . ,n}, are assigned to the nodes, designating 
a necessary visit of city i with the corresponding probability pi, Xi ^ B{l,Pi). A subset SCI 
of k nodes (0 < A: < n, |5| = A:) results as a realisation of the random variables Xi. The Xi are 
independently distributed, and there are two types of nodes. The nodes are divided into rii “black 
nodes”, which are always present, pi := 1 (type 1 nodes), and ri 2 “white nodes”, which have to be 
included in the tour with a probability p* < 1 (type 2 nodes). Probabilities are individually assigned 
to the single “white nodes”: i € / : p, ^ Pj, Vi ^ in general. 

A tour will be represented by a path / = (1, . . . , ij, . . . , 1), ij £ I \ {1}, ij ^ ik for i^k. To avoid a 
duplication of a tour, e.g. a rotation of (1, 2, 3, 4, 5, 6, 7, 8, 1) i.e. (2, 3, 4, 5, 6, 7, 8, 1, 2), the first 
city of the tour will be fixed as city 1. We assume that the number of black nodes are 1 or higher, 
so without restriction node 1 is a black node. 




The quality of an a priori solution will be evaluated by the expected length of a possible a priori tour. 
As known the expected length E[Lf] of a tour / can be calculated in 0{n^) time by the following 
formula: Let the optimal a priori tour be / = (1, 2, . . . , n, 1), then 

^jPiPi n (1 - p*) + s L <^ijPiPi n (1 - p*) 11(1 - pi). ( 1 ) 

t=l j=i+l fc=i+l 1=1 j=l k=i+l l=l 

E[Lf] refers to the lengths of the single distances dij between two nodes i and j. The traveling 
salesman goes directly from i to j, using the distance dij^ while i and j are present in the tour 
with the product probability pi • pj, and the nodes k between i and j in the tour are absent with 
the probability {1 — Pk). In the first term {i < j) we multiply by (1 — Pk) for those k in between 
i < k < j, in the second term {i > j) hy {1 - Pk) and (1 - P/) for those k and I of the complement, 
A: = z + 1, . . . ,n and / = 1, . . . , j — 1. 

3. A Branch-and-Bound Algorithm to solve the PTSP 

The branching is based on the specific tour construction procedure. In the first step we branch into 
n - 1 subproblems, one subproblem for every possibility to choose a node for /(2), the second city 
in the tour. A lower bound will be calculated for every combination of that branching. The node 
with the lowest bound will determine the next branching. So when reaching the first solution for an 
a priori tour, there are (n - 1)! boundings to calculate. 

The lower bound of such a subproblem are simple calculations. Corresponding to the branching the 
bounding will be a step by step calculation of E[Lf] of formula (1). In the first step of the branching 
/(2) is determined. From E[Lf] a lower bound B2{f{l),f{2)) for the PTSP can be established 

^2(/(l), /(2)) = C^/(l)/(2)P/(l)P/(2)(l + df(2)f(l)Pf(l)Pf(2) 11(1 ~ Pk)). 

Hf 

^ 2 (/( 1 ),/( 2 )) contains that part of formula (1) which refers to the distances d/(i)/( 2 ) and d/( 2 )/(i), 
while /(2) is in the tour direct after /(I) and the nodes k^,..., kio must be in / between /(2) and 
the endpoint /(I). Hence to go directly from f{2) to /(I) is only possible if the nodes /cs, • • • , A;io are 
absent. In step m by including an arbitrary edge (/(m - 1), /(m)) to the tour, the lower bound will 
change as follows: 

m /(m) 

Bm+l{fm+l) = Bmifm) + «^/(t)./('"+l)P/(i)P/(m+l) H “ P*) + 

•=1 *=/(<+!) 



m /(*“!) 

D^/«./(m+l)P/(m+l)P/(i) n (1 -P*) 11(1 “Pj) 
i=l k=f(l) its 

where /„ = (1, /(2), . . . ,/(m)). B^Um) is the actual lower bound for /„. In the m-th step we 
add two terms to Bmifm), both terms refer to the distances between the new node f{m + 1) and 
all nodes which actually belong to the tour fm- in the first term we look at every distance from 
a node f{i) € fm to f(m + 1) and multiply the probabilities PiP/(m) and (1 - pk) for the nodes 
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A: € (/(z + 1), f{m)). In the second term the distances are used from /(m -f 1) to all nodes of fm, 
where we multiply the probabilities Pf{m)Pi and {1 - pj) for the nodes j ^ and (1 - p*;) for the 
nodes k € (/(I), f{i - 1)). 

For illustration consider figure 1 where a seventh node /(7) is added to the tour /. 



f(3) f(4) 




In the last step of the calculation, when / is a possible a priori tour through all nodes i 6 /, the 
bound attains the expected total length: 



B^{f)=E[L,]. 

A better lower bound can be calculated by adding a reduced minimum spanning tree (MST) bound 
BMsr{i ■ ^ ^ fm) for the nodes z, which are in step m — 1 not in the tour fm- There is the well 
known relationship between the length of a tour built by the minimum spanning tree [Li^mst)) and 
the length of the traveling salesman tour Ltsp in the deterministic case: 

Wmst) ^ 2, 

Ltsp ~ 

However, there is no simple relationship between the expected length of an a priori tour formed by a 
minimum spanning tree algorithm {E[Lt{MST)]) and the expected length of the optimal a priori tour 
[ E [ LpTSp ])’ Adapted from Bertsimas [1988] there is: 

E[Lt{MST)] ^ 2 

E[LpTSp] ~ minpl' 

For example for min{pi) = 0.1, the bound will be 200.So we will only calculate a MST regarding the 
distance matrix dij • Pi • Pj for the nodes z, j ^ fm starting at /(m): 



EMSrii • * ^ fm) = ^di^mst{t)PiPmst{t) 

iif 



with mst{i) as predecessor of i in the tree. 

For all tours t based on fm the expected length can be estimated as follows: 



^ Em{fm) ^ Em{fm) ^t{i)ymst{t{i))Pt{i)Pmst{t{i)) 

tmf tmf 
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Numerical results show that the additional calculation of a minimum spanning tree is advantageous 
for PTSP, which are close to the TSP. Closeness e.g. is given by problems with a large number 
of black nodes or with probabilities well above 0.5. For all other cases the bounding Bm{fm) will 
determine a faster branch- and-bound algorithm. 

4. An evolutionary algorithm to solve the PTSP 

The branch-and-bound-approach needs clearly non-polynomial time to solve the PTSP. Even if we 
use a 15 node example by simple calulation one could establish computing times which are in the 
order of weeks. Obviously this is absurd, no one can use such a strategy. In the last years heuristics 
for optimization problems, in particular in the area of genetic algorithms, have been improved vastly. 
Also we will look at evolutionary programs to solve the PTSP. 

For an efficient evolutionary program we had to find a genetic representation for the solutions of the 
problem. We use feasible a priori tours. The representation of a tour will be the path representation, 
which is described above. For the initial population we will discuss two successful cases. In the 
first case all individuals of the initial population will be randomly chosen tours. In the second 
10 — 25% of the individuals will be generated by simple greedy algorithms for the TSP (as a possible 
approximation for the PTSP) or the PTSP (the algorithm based on the same idea as the bounding 
Bmifm))- Obviously we have to check duplicate individuals, which have to be deleted. The rest 
of the population will be filled up by random tours. It is beneficial to choose some individuals 
randomly, so that the evolutionary algorithm can create possible solutions from the whole space of 
feasible solutions. Some experiments show that the solutions will be better than the solutions of 
starting-generations which are all generated by heuristics for the PTSP or TSP. 

The evaluation function will be E[Lf] (equation (1) above). The operator to generate the offspring 
of two randomly chosen parent tours will be a heuristic crossover. There will be no additional 
mutation of the individuals. As shown for the TSP by Grefenstette [1985], we can make an 
experimental proof that crossovers which do not use further information of the PTSP yield poor 
results. Grefenstette proposed for the TSP to take into consideration the length of the edges leaving 
a city in the parent tours. For the offspring he chooses the shorter one, and in the case that the 
including of that shorter edge introduces a cycle, he extends the tour by a randomly chosen edge. 
In such an approach a lot of random choices will be made. A better approach is the look at all 
edges of one city, leaving and reaching the city in one of the parent tours. Following Whitley, 
Starkweather and Fuquay [1989] we will create an edge list for every city i. The list consists 
of those cities which are connected to city i in at least one of the parents. To avoid too many random 
choices we start with the city that has the smallest number of entries in the edge list. In the same 
way the tour will be built up: from the list of the last included node in the tour we choose that city 
which now has the smallest number of cities in its edge list. The chosen cities are deleted from the 
remaining edge lists. In the case of more than one city with the same number of cities in the list, 
Whitley, Starkweather and Fuquay make a random choice. Except the first choice we will take that 
city to be the next node in the tour, which has the lowest bound Bm{fm)- In the case that there 
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is no city left in the edge list, the next city of the offspring tour will be determined as there are all 
cities in the list which are not included in the tour at that time. 

By this heuristic crossover we generate an offspring of half the size of the parent generation. These 
offspring together with 50% of the parent generation will create the new parent generation for the 
next run of that genetic operator. 

Finally the exact solution method will be compared with an approximate evolutionary algorithm. 
The main result is that the exact solution can be approximated closely and fast by our heuristic 
algorithm. 

5. Numerical results 

The described algorithms were tested on problems with randomly generated cities. The coordinates of 
the cities are uniformly distributed. The number of cities range from 10 to 14 in the first experiment, 
where we will compare the branch-and-bound-algorithm (BAB) and the two evolutionary programs 
EPl (with randomly generated initial population) and EP2 (with solutions of greedy heuristics in 
the first parent generation). To see the performance of an algorithm it is not necessary to work with 
large numbers of cities. The number of black nodes will be 1, the probabilities are uniform random 
numbers, 0 < Pi < 1. 

The population size will be fixed at 100. The number of greedy-solutions in the starting population 
of EP2 will be 2 X number of the cities minus the duplicates. 

After 10 runs of the genetic operators there are tours which present the exact solution among the 
offspring tours, between 20 and 50 runs the algorithm converges to one solution tour. 

The number of edges which are not from one of the parent tours is very small. For 10-city-problems 
there are about 0-2 such edges in 10 runs, for 14-cities-problems the range is from 4 to 20 edges in 
10 runs, and in every run there are 50 offspring tours created. Computing time for the evolutionary 
programs depends obviously polynomially on the number of cities and the corresponding population 
size. For our 10- 14-city-problems the running time for one problem is lower than 10 secs (for our 
small machine, Apple Power Macintosh G3). Table 1 shows the results for 100 problems of each 
problem size. Both algorithms, EPl and EP2, complement each other. Usually a failing solution of 
one generally is solved by the other algorithm. In that cases in which EP2 do not reach the exact 
solution, the greedy initial tours directs after a few runs the algorithm to a local optimum. 





n = 10 


n = 11 


n = 12 


n = 13 


n = 14 


EPl 


100 


98 


97 


95 " 


95 


EP2 


100 


99 


97 


97 


94 



Table 1: Number of problems out of 100, EP1/EP2 results in the exact solution, each different solution less than 1% 
difference from our optimum 

For problems of more than 15 nodes there are no exact results because they take too much time for 
my research. There does not exist a PTSP-library with numbers of exact and heuristic solutions of 
bigger problems that are results of other researchers. So I present the results of some problems of 




173 



40 to 50 nodes in table 2 to demonstrate the EPl for the PTSP. The problems are generated in the 
same manner as in the first experiment. The population size is between 200 and 300. The computing 
time varies from 5 to 8 minutes. 



k 


best tour 


worst tour 


10 


0,64 


0,88 


50 


0,36 


0,55 


100 


0,30 


0,30 



Table 2: Average percentage rate between a tour of the k-th generation and the best tour of the initial population 

Finally, to demonstrate the quality of the algorithm EPl we will consider the well known quadratic 
49-cities-problem shown in figure 2(a) in the special case of the TSP (all nodes are black nodes): 




Figure 2(a): A 49-cities-problem 

The population size is 250, and all initial tours are randomly chosen. In figure 2(b) we see the best 
of these initial tours of length 152.92. In figure 2(c) the best tour after 50 runs is shown. The length 
of that tour is 60.12, which is not optimum (see the crossing and the dead alleys). The worst case of 
a tour length in that generation is 103.17, well below the starting length of 152.92. 




Figure 2(b): best tour of the first population 



Figure 2(c): best tour of the 50th generation 
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Figure 2(d) finally shows the result of the EPl after 120 runs of length of 49.41. The algorithm 
converges to two solutions, the second is a tour with a length of 50.24. 




Figure 2(d): solution tour at the 120th generation 
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A Genetic Algorithm with a Non-Binary Representation 
for the Set Covering Problem 

Anton V. Eremeev, Omsk Branch of Sobolev Institute of Mathematics, RAS 

Summary: A genetic algorithm (GA) for the set covering problem is developed using a non-binary 
solution representation. The paper describes the GA and the new problem-specific operators of 
crossover and local improvement that we have implemented in it. Usage of these operators leads 
to a substantial improvement of the algorithm. Finally, some promising results of computation 
experiment and comparison to other algorithms are reported. 

1. Introduction 

The set covering problem (SCP) can be stated as follows. Consider a set M = ,m} and the 

subsets Mj C M, where j e N = n}. A subset J C N is a cover of M if (J Mj = M. For 

jeJ 

each Mj, a positive cost Cj is assigned. The SCP is to find a cover of minimum summary cost. 
Sometimes it is more convenient to consider SCP as an integer linear programming problem: 

min{ca; : Ax > e,x £ {0, 1}”}, (1) 

where A is an m x n matrix of Os and Is, c is an n- vector of costs, e is the m- vector of Is, and = 1, 
if and only if 2 € Mj. Using this formulation one can regard SCP as a problem of optimal covering of 
rows by the columns in matrix A. In this paper we assume that the columns are ordered according 
to nondecreasing values of cj-s and if two columns have equal costs then the one which covers more 
rows stands first. 

Different algorithms have been proposed for the exact solving of SCP using branch and bound, cutting 
planes [1], L-class enumeration [9,11] and other techniques. However, SCP is an A/'P-hard problem 
and usually application of exact algorithms to the large-scale instances is very time-consuming. 

A number of heuristic algorithms are developed for approximate solving the problem within relatively 
short running time. The solutions of good quality may be obtained using Lagrangian relaxation 
heuristics [4], neural networks [10] or metaheuristics such as GA [3]. 

2. Genetic Algorithm 

GA is a probabilistic search method based on biological analogy of selective breeding in nature, 
used for the search of near-optimal solutions. The basic components of GA are a population of 
individuals and random operators that are introduced to model mutation and crossover in nature. 
An individual is a pair of genotype g and phenotype x{g) corresponding to a search point in the space 
of solutions D to a given optimization problem. Here p is a fixed length string of symbols (called 
genes) from some alphabet A. The function x{g) maps g to its phenotype x{g) e D, thus defining 
a representation of solutions in GA. The search is guided by the results of evaluating the fitness 
function ^{g) = (j)(f{x{g))) for each genotype in population (here f : D R is the goal function, 
and (j) : R is a. monotone function which is often introduced to intensify the GA search). 
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The genotypes of the initial population are randomly generated according to some a priori defined 
probability distribution. The GAs discussed in this paper implement the steady-state reproductive 
strategy [6], in which a single new individual is generated at each iteration. The genotype of a new 
individual replaces the least fit one in the population, thus the population size s remains constant 
during the run of GA. Usually the stopping criterion is the limit on the number of iterations tmax- 
The best solution found is returned as the final result when GA stops. 

The construction of a new genotype (offspring) starts with choosing a pair of parents from the 
current population with a help of probabilistic selection operator. Then the crossover operator 
replaces some genes of one parent with genes of the other. The resulting string undergoes mutation, 
where a randomly chosen subset of genes is substituted with random symbols from A. In this paper 
we consider a GA where a gene is mutated with a fixed probability Pm- The settings of pm^ s and the 
probability of crossover usage are very important for efficient work of GA. Usually they are chosen 
empirically for the specific class of optimization problems. In many successful implementations the 
genotype is processed by a heuristic of local improvement or by a repair operator which makes the 
solution feasible before it is added to a population. 

An efficient GA for SCP was proposed by Beasley and Chu [3]. The algorithm uses a convenient 
binary representation of solutions as n-bit strings (i.e. A = {0, 1}). A column j is included into 
phenotype x{g) if and only if the bit j in genotype g equals 1. The only inconvenience with this 
representation is that one has to deal with infeasibile phenotypes that may be produced by mutation 
and crossover. A greedy-like heuristic is used in [3] to repair the infeasible solutions before adding 
them to population. In the next section we describe a GA, which uses a similar reproduction strategy 
and provides only feasible solutions due to a non-binary representation. 

2.1 General Scheme of the Non-Binary Genetic Algorithm 

Let’s denote the set of columns that cover the row i by Ni = {j e N : Oij = 1}. The non- binary GA 
(NBGA) proposed in this paper is based on a non-binary representation where the genotype consists 
of m genes , g^'^\ such that € Ni, i = 1, 2 , . . . , m. Here each gene contains a column 

index that is assigned to cover the row. 

In this representation x maps a genotype g to the phenotype x{g) G {0, 1}”, where ones correspond 
to the columns present in genes of g. Obviously, it is a feasible solution to the problem (1) and 
no repair operator is needed. However, as it was noted in [3], the implementation of GA with the 
non-binary representation requires an additional operator to eliminate the redundant columns from 
the solution after crossover and mutation. For this local improvement we use greedy heuristics that 
find approximate solutions to a corresponding reduced version Pg of the given SCP. The reduced 
problem Pg has a matrix that consists of the columns of A represented in genes of g. 

An improved genotype is added to the population only if there are no individuals with the same 
phenotype in it yet. The genotypes of the initial population are generated independently and each 
gene is uniformly distributed over the set of indices of columns covering the gene. 

Let’s consider the current population on iteration t as a vector of genotypes of its individuals 
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G — (^ 1 ) ^ 2 j • • • , 9s)- Then the fitness function on iteration t is defined as 

^t{g) = cx{gi^t)) - cx{g), 

where l{t) is the index of the individual of the largest cover cost in the population on iteration t. 
The selection operator in our GA implements the proportional selection scheme, where the probability 
to choose the A;-th individual (A: = 1, 2, . . . , s) equals 

P[9k) = ^t{9k) ^t{9i) 

\/=l 

The general scheme of the NBGA 

1. While the initial population is not complete do 

1.1. Generate a random genotype g. 

1.2. Apply the column elimination procedure Prime to g and add g to the population. 

2. For t:=l to tmax do 

2.1. Choose the parent genotypes g^, gv with the help of the proportional selection. 

2.2. Produce an offspring g from g^ and g^ using a crossover operator. 

2.3. Mutate each gene of g with probability Pm- 

2.4. Obtain genotype g', applying column elimination procedures Greedy and Dual Greedy to g. 

2.5. If there are no individuals in G with the phenotype x{g')^ then 

2.5.1. substitute the least fit genotype in G by 

2.5.2. otherwise substitute the least fit genotype in G by p. 

The operators of crossover, mutation and the local improvement procedures Pnme, Greedy and Dual 
Greedy will be described in detail in the following sections. The condition in step 2.5 is introduced to 
prevent undesirable premature convergence of the population to local optima. Thus, the population 
in this GA also works as a tabu list of tabu search heuristics. In steps 1.2 and 2.5.2 an individual is 
added without any tests because duplicate solutions are very rare in these cases. 

2.2 Crossover and Mutation 

In this section we propose a crossover operator which is specially designed for SCP. Here we refer to 
it as LP-crossover. The goal of this operator is to find the best possible combination of the given 
parent genotypes and g^, if it is possible without extensive computations. 

Let’s consider a problem of optimal crossover Poe? which is a reduced version of the initial SCP but 

with the covering subsets restricted by the set of indices N' = {j : x{gu)j = 1} U {j : x{gy)j = 1}. 

First, in LP-crossover a trivial reduction is applied to P^c. Denote 5 = {z G M : \NinN'\ = 1}. Then 

each row i e S may be covered by a single column j{i) in Poc- Let’s call Q = {j : j = j(z), z G 5} a 

set of fixed columns. For each z G \J Mj, one of the columns that cover the gene in Q is assigned. 

jeQ 

Let’s refer to these genes as fixed too. As a result of the reduction we obtain a new subproblem P,., 
that consists of the rows and columns that were not fixed during this procedure. 

Second, the dual simplex method is used to solve the linear relaxation of P,., where the Boolean 
constraints are replaced with the conditions xj > 0 for all j. If the solution x' obtained by simplex 
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method turns out to be integer, then it is used to complete the genotype g by an assignment of the 
non-fixed genes that corresponds to the solution x'. 

To avoid time-consuming computations we don’t start the simplex method if the number of rows in 
Pr exceeds a certain threshold /x (we use g. = 150). If this is the case or if the number of simplex 
iterations exceeds its limit (equal to 300 in our experiments), or if the solution x' is fractional, then 
LP- crossover returns an unchanged genotype g^. 

We designed the mutation operator similar to the proportional selection procedure. Suppose, i-th 
gene is to be mutated. Then the probability to assign a column j € Ni to it equals 




2.3 Redundant Columns Elimination 

We use three greedy-type heuristics to exclude the redundant columns from the solution. 

The most simple and fast heuristic starts with a given cover and discards the columns in decreasing 
order of indices. A column is discarded only if the remaining solution is still a cover. We refer to 
this algorithm as Prime. It finds a prime cover, i.e. a solution without redundant columns. 

The second heuristic is the well-known Greedy algorithm [5]. It sequentially builds a cover to the 
problem Pg by adding at every step a column maximizing the ratio Wj/cj^ where wj is the number of 
unsatisfied inequalities that use the variable Xj. Ties are broken lexicographically (i.e. the variable 
with the smallest index is picked first). This algorithm might find a solution which is not minimal, 
therefore Prime is run after it to eliminate the redundant columns. 

The next heuristic we shall call the Dual Greedy algorithm. It combines the successive columns 
discarding of Prime and the adaptive columns pricing similar to Greedy. Let’s denote the set of 
columns in the subproblem Pg by AT' := {j^N: x{g)j = 1}. A cover J is obtained as follows. 

The Dual Greedy Algorithm 

1. Set N[ := Ni fl iV' for all i = 1, ..., m, and Mj := Mj for all j € TV', M' := M, J' := N\ J := 0. 

2. While J' ^ 0 do 

2.1. If there is i G M' such that [AJI = 1, then 

using the j G TVJ, set J := J U {j}, M' := 

Otherwise choose j G J' such that > Ck\M\^\~^ for all k G J'. 

2.2. Set J' := 

In step 1.2 of NBGA we use the Prime algorithm. The choice is motivated by our computation 
experiments, which show that the usage of more complex algorithms like Greedy and Dual Greedy 
is not justified at this stage. Firstly, they are too time-consuming compared to Prime, secondly, for 
some problems they construct the initial populations with individuals of a similar structure which is 
locally optimal. This bias might hinder the GA from attaining the global optimum. 

Greedy and Dual Greedy are used in step 2.4 of NBGA. Both heuristics are called and the best of the 
two solutions x' is accepted to construct a new genotype p', which differs from g in the genes that 
represent the columns absent in x'. These genes are switched in p' to other columns that are still in x'. 
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3. Computation Experience 

The NBGA was implemented in Borland Pascal on 100 MHz Pentium and tested on the OR-Library 
[2] benchmark problem sets A-H^ and a set of combinatorial problems CLR. The sets 4-6 and 
i4-^ consist of non-unicost problems with Cj taken from [1,100]. Before solving these problems we 
apply a core-refining reduction which is similar to the approach used in [3]. The reduction keeps 

m 

only the columns belonging to the set A^core = U • Here a\ is the set of the least r indices in 

t=i 

= 1, . . . ,m. In our experiments this reduction permitted the NBGA at least once out of 10 
trials to find the optimal solutions to all problems of A-D and the best-known solutions to the 
problems in E-H except for HI and H2. Using the reduction with r = 5, the NBGA attained the 
best-known solution to the problem H2 also. The dimensions of problems before and after the core- 
refining reduction as well as their density (i.e. the percentage of ones in the original SCP matrix), 
and the number of problems in set are given in Table 1. 



Name 


4 


5 


6 


A 


B 


C 


D 


E 


F 


G 


H 


Rows (m) 


200 


200 


200 


300 


300 


400 


400 


500 


500 


1000 


1000 


Columns (n) 


1000 


2000 


1000 


3000 


3000 


4000 


4000 


5000 


5000 


10000 


10000 


Rows in core 


640-696 


628-672 


293-315 


694-724 


312-329 


720-737 


328-341 


171-180 


91-96 


808 - 817 


353-370 


Density (%) 


2 


2 


5 


2 


5 


2 


5 


10 


20 


2 


5 


Num. of probl. 


10 


10 


5 


5 


5 


5 


5 


5 


5 


5 


5 



Table 1. 



Tables 2 and 3 compare the performance of the NBGA to the performance of GA by Beasley and Chu 
[3] (BC), and the Lagrangian heuristic by Caprara, Fischetti and Toth [4] (CFT). In our experiments 
10 trials of GA were made for each problem with the settings s = 100, pm = 0.1, tmax = 10000. The 
column a represents the average percentage deviation • 100%, where Zk is the solution 

found in A;-th trial and z* is the optimal or the best-known value. Let f^at be the frequency of 
obtaining the best solution found by a GA in a series of 10 trials. The average time Taoi spent until 
the final solution occurred, the average time of obtaining the best solution found in 10 trials Tbat, 
and the average execution time T^x are given in lOOMHz Pentium seconds. The results obtained by 
other authors are converted by estimating the performance of the computers using Dongarra report 
[7]. In Table 2 the mentioned above values are averaged over all instances of problem set for the sets 
4-6 and A-D. 

The algorithm CFT was run once for each problem [4], thus f^at and a are not available. The optimal 
or the best-known solutions to all problems in and A-H are found by it. 



Name 


CFT 


BC 


NBGA 




EM 


Tex 


■M 


- 


EW 


Tex 


B9 






EM 


T„ 


4 




2212.4 






44.95 


315.06 




^^3 


031 


E1E 


10^1 


5 




2212.4 


iilr/il 






550.80 






031 


M¥km 


OB3B 


6 


Qyi 


2212.4 






25.23 


319.10 






ESQ9 


K£El 




A 


10^3 


2212.4 


1X9 












031 


009! 


1693.0 


B 


IQQI 






Em 


68.63 






lOII 


Oilil 


OSilil 




C 








B9 


87.93 


EESSQI 


itm 


003 


^39! 


lOQi 


721.3 




7.64 


2212.4 


0.96 


0.06 


101.70 


1372.63 


0.94 


0.08 


26.62 


23.33 


582.7 



Table 2. 



The optimal solutions to all problems 'm 4-6 and A-D except 5.3 are obtained by BC at least once in 
10 trials. The current best-known solutions for G2, HI, H2 are better then those found by BC in [3]. 
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Table 3 gives more detailed information on experiments with the sets E-H. Here BK is the best-known 
solution value. The lower bound found by solving SCP linear relaxation is denoted by LP. 



Name 


BK 


LP 


CFT 


BC 


NBGA 








Best 


'^hat 


Tex 


Best 




a 


Taol 


Tex 


Best 


E9I 


<7 


That 


Taol 


Tex 


El 


29 


21.3 


29 


11.5 


2212 


29 


1.0 


0.0 


16.9 


3014.1 


29 


1.0 


0.0 


1.0 


1.0 


562.6 


E2 


30 


22.4 


30 


180.5 


2212 


30 


0.4 


2.0 


266.9 


3101.5 


30 


1.0 


0.0 


94.8 


94.8 


1768.4 


E3 


27 


20.5 


27 


41.7 


2212 


27 


0.3 


2.6 


85.1 


3110.4 


27 


1.0 


0.0 


23.1 


23.1 


1345.3 


E4 


28 


21.4 


28 


11.6 


2212 


28 


1.0 


0.0 


238.5 


3093.5 


28 


1.0 


0.0 


11.0 


11.0 


1234.4 


E5 


28 


21.3 


28 


16.2 


2212 


28 


1.0 


0.0 


15.5 


3122.4 


28 


1.0 


0.0 


2.1 


2.1 


1067.2 


FI 


14 


8.8 


14 


14.7 


2212 


14 


1.0 


0.0 


33.8 


5026.0 


14 


1.0 


0.0 


7.9 


7.9 


1698.6 


F2 


15 


10.0 


16 


13.8 


2212 


15 


1.0 


0.0 


34.5 


5535.5 


15 


1.0 


0.0 


1.3 


1.3 


1548.5 


F3 


14 


9.5 


14 


110.0 


2212 


14 


1.0 


0.0 


117.9 


5318.1 


14 


1.0 


0.0 


55.4 


55.4 


1765.1 


F4 


14 


8.5 


14 


13.7 


2212 


14 


1.0 


0.0 


92.6 


5710.3 


14 


1.0 


0.0 


20.4 


20.4 


5686.0 


F5 


13 


7.8 


13 


89.0 


2212 


13 


0.3 


5.4 


67.1 


5760.0 


13 


0.3 


5.4 


497.4 


151.2 


8290.3 


G1 


176 


159.9 


176 


65.0 


2212 


176 


0.2 


1.0 


451.3 


3222.0 


176 


0.7 


0.3 


115.0 


96.0 


795.0 


G2 


154 


142.1 


154 


346.6 


2212 


155 


0.5 


1.5 


159.3 


3628.0 


154 


0.5 


0.65 


318.3 


226.6 


836.6 


G3 


166 


148.3 


166 


432.7 


2212 


166 


0.1 


1.1 


312.1 


3151.5 


166 


0.1 


0.8 


627.6 


319.1 


972.5 


G4 


168 


148.9 


168 


167.5 


2212 


168 


0.2 


1.4 


665.4 


3503.9 


168 


0.4 


0.7 


160.0 


172.5 


884.8 


G5 


168 


148.2 


168 


105.0 


2212 


168 


0.2 


0.8 


242.6 


3696.4 


168 


0.7 


0.2 


161.2 


170.4 


952.2 


HI 


63 


48.1 


63 


642.1 


2212 


64 


1.0 


1.6 


743.0 


5383.4 


64 


1.0 


0.0 


90.5 


90.5 


1573.7 


H2 


63 


48.7 


63 


392.5 


2212 


64 


1.0 


1.6 


234.3 


5750.0 


64 


1.0 


0.0 


34.7 


34.7 


9649.5 


H3 


59 


45.2 


59 


690.4 


2212 


59 


0.9 


0.2 


796.6 


5703.6 


59 


1.0 


0.0 


493.2 


493.2 


5619.8 


H4 


58 


44.0 


58 


105.1 


2212 


58 


0.4 


1.6 


962.9 


5534.9 


58 


1.0 


0.0 


218.2 


218.2 


3458.3 


H5 


55 


42.4 


55 


68.8 


2212 


55 


0.9 


0.2 


198.6 


5610.1 


55 


1.0 


0.0 


25.2 


25.2 


1049.4 



Table 3. 



We have also tested the NBGA with uniform crossover (where each gene of the offspring is copied 
from one of the parents with equal probabilities) and with more simple local improvement schemes, 
but these versions turned out to be less efficient. 

The NBGA was also tested on a set of combinatorial problems CLR derived from a question of 
P.Erdos: ’’what is md{r), the minimal number of r -tuples that can be chosen so that every coloring 
of d elements with 2 colors contains at least one monochromatic r -tuple?” (here d>2r- 1)[8]. This 
problem can be stated as a unicost SCP [10], where M is the set of colorings of d elements (due to a 
symmetry only a half of colorings needs to be considered), N is the set of all r-tuples. Assume that 
an r-tuple covers the colorings for which it is monochromatic. We consider the case r = 4. Table 4 
shows the dimensions of SCP instances, the best-known upper bounds on md(4) from literature and 
the values of the solutions found by NBGA. The columns fbsu TtsuTgoi have the same meaning as in 
Table 3. Here we again use s = 100, tmax = 10000. The values of Pm are indicated in column pm- 



d II Rows (m) 


Columns (n) 


Best-known 


NBGA 


fbst 


That 


Taol 


Pm 


9 


255 


126 


26 


26 


1.6 


15.32 


15.32 


0.05 


10 


511 


210 


25 


25 


1.0 


27.0 


27.0 


0.05 


11 


1023 


330 


23 


23 


1.0 


274.6 


274.6 


0.05 


12 


2047 


495 


26 


23 


0.2 


1067.0 


979.7 


0.02 


13 


4095 


715 


26 


23 


0.3 


4000.4 


4615.4 


0.01 



Table 4. 



For the cases d = 12 and 13 the best solutions known in literature, are improved by NBGA, and for 
the cases d = 9, 10, 11 it finds the solutions of previously known values (see e.g.[10j). In fact, simple 
solutions of 23 tuples could be constructed for all d' > 11 without a GA. To do this one should take 
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the existing solution for d = 11 and use the same set of tuples for all d! > d (similarly it can be 
proved in general that rrid{r) is a monotone non-increasing function in d). 

According to the computation experiments on the set E the NBGA with the proposed settings 
dominates both other algorithms. The sets F and H include some problems on which NBGA is 
inferior to CFT and some problems where it is more efficient. The results for both GAs on the sets 
4-6 and A-D are analogous but the solution times are longer, than for CFT. In terms of the best 
found solutions the performance of NBGA is between those of BC and CFT. 

Further research may address the development of exact hybrid algorithms using e.g. an approach 
similar to [9]. The possibility of parallel implementation of NBGA might be considered too. 

The research was supported in part by RFBR Grant 97-01-00771, and INTAS Grant 96-0820. 
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Summary: We consider Constrained Semi-Assignment Problems, where one has to optimally 

assign discrete values to decision variables, and where furthermore logical constraints restrict the 
set of feasible assignments. We present a heuristic approach based on a continuous relaxation. 
More precisely, we consider a discrete dynamical system evolving in the interior of a polytope whose 
extremal points correspond to all possible (both feasible and infeasible) assignments. The constructed 
dynamical system combines the effects of two dynamics: The first one, being of gradient-type, has 
the task of attracting the system towards assignments with high objective value, the second dynamic 
should give a rejecting power to forbidden assignments. Local properties of the system and numerical 
experiments are discussed. 



1. Introduction 

The constrained maximization problem considered in this paper generalizes the well known pseu- 
doboolean optimization and satisfiability (SAT) problems. Instead of boolean variables we consider 
decision variables having a finite set of possible values. As in SAT, a set of logical clauses is given and 
those assignments satisfying all given clauses constitute the set of feasible solutions of our problem. 
Furthermore, each assignment is given an objective value by means of a set of weighted clauses: 
the value of an assignment is the sum of the weights of the clauses it satisfies. The constrained 
semi-assignment problem (C-SAP) consists in finding a feasible assignment of maximal value. For 
the formal definition let (i,r) stand for the predicate: value r is assigned to variable Xj. 

Constrained semi- assignment problem ( C-SAP): Given 

• a set of decision variables Xj, i G TV := {1, . . . , n} with possible values in AT := {1, 2, . . . , A:}, 

• a set of clauses 7^, where Re 71 is of the form 

R = -> {{ii,ri) A (* 2 , r- 2 ) A . . . A (i„, r„)) 

for some 2 < u < n, ij G N, rj G AT, for all j < u. For convenience, R will be used to denote 
the set R = {(ii, n), (z 2 , r 2 ), . . . , (iu, r-^)} as well. 

• a set of weighted clauses T with weights iot > 0, T G T, where T is of the form 



T = (ii, ri) A (i2, T2) A . . . A (i„, r„) 



for some u < n, ij G rj G AT, for all j < u. Again, T is identified with the set T = 
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Find an assignment of values in K to the variables satisfying all clauses of 'R. and maximizing 
^{wt\T € T,T satisfied by the assignement}. 

Note that for A; = 2 and T = 0 C-SAP reduces to the classical satisfyability problem; for k = 2 
and 71 = 0 we have a pseudo-boolean optimization (see formulation (l)-(3)). Many other combi- 
natorial problems can be stated in this form, like max A;-cut, ^-coloring and max clique. C-SAP 
contains therefore many NP-hard problems, including some for which even finding a poor approx- 
imate solution is NP-hard. Moreover, we believe that C-SAP is an adequate formalism for many 
interesting applications, like for example the label placement problem where a finite set of possible 
positions is defined for each label and additionaly to the natural non intersecting clauses for the 
labels, a valuation is introduced according to aesthetical criteria or to the ambiguities resulting from 
the placements. 

A common and often successful approach for such problems are local search (LS) heuristics, like 
Simulated Annealing or Tabu Search. However, due to their inherent myopy, such heuristics can 
hardly deal with a C-SAP of the following paradigmatic type: 

Let D C be given and i/ € jD be a unique isolated global optimum with objective value z{y) = 

M > 0. Moreover, let z{x) = 0 for all x € D\{y} (C-SAP with T := {T := {(1, yi), (2, 1 / 2 ), • • • , (?^, Vn)}} 
and 71:= |{(1, Zi), . . . , (n, ^n)}k ^ \ ■^})* ^ur goal is to present a heuristic with some ability to 

’orient itself’ even in such problems and which therefore offers an alternative approach for instances, 
where LS algorithms have problems of orientation. In fact, we will show that if M is big enough in 
the above instance our algorithm will find the global optimum (see Theorem 2.1). 

Our approach is to embed the set of feasible solutions in the continuous space and to develop a 
heuristic based on a discrete time dynamical system. The concept of our approach can be formulated 
as follows: We represent the set of (a priori) possible assignments x G as points in 

:= {p e {0 . = 1 for all i € N}, 

r=l 

where p G corresponds to the assignment Xi = r iS pir = 1. Then a C-SAP problem (with sets 7Z 
and T) can be written as 



max z(p) = Iwt 


n Pir 


(1) 


Ter \ 


Mer J 




s.t. H Pir = 0 


for all R G 7^ 


(2) 


pG A^ 




(3) 



Let now A := Conv(A^) and A® be its interior. Our approach consists in defining a mapping 
F : A® A® and to use the discrete time dynamical system resulting by iteration of F for our 
algorithm which runs essentially as follows: Choose a point p° G A® and compute the sequence 
p* := F(p*“^), 2 = 1,2, ..., t for some t. Return as ’’solution” p G A^ the assignment p closest to p^ 
The underlying philosophy is that optimal solutions of the combinatorial problem should correspond 
to attractors of the dynamical systems. The definition of F is of course central and we would be 
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already happy to have the sequence p* converging to one of those points in corresponding to a 
’good’ feasible solution for a reasonably large proportion of starting points. The list of desirable 
properties of such a dynamic includes furthermore the instability of non interpretable fixed points 
in and the escape from some neighborhood of non feasible assignments (in particular of those 
having a feasible neighbor), no matter the value of the assignment. 

The application of dynamical systems in combinatorial optimization is definitely not a popular ap- 
proach, however the works of [2] for Maxclique, [4] for Maxsat, and [10] for combinatorial problems 
reducible to proper assignment problems (i.e assignment in the sense of permutation) show that it 
has some potential to be explored (see also the book of [6]). 

In the first part of this paper we address the design of a mapping F for C-SAP and discuss some of 
its basic properties. In the second part we shall report on some numerical experiments. 

2. Algorithm 

The Algorithm consists in defining an adequate operator F : A® -> A° for a given C-SAP problem 
and to generate a sequence of points = F(p°), ... for some t and to round to a solution 
p in A^. For F we consider mappings defined by 

F{j>)ir := with Ni := (4) 

where the so-called fitness functions fir(p) > 0 are given. As we shall consider different possibilities for 
we shall use the notation F[f] for the mapping defined by (4). In the setting of multi-population 
dynamic where n populations are considered whose members can adopt one of k possible strategies, 
this kind of dynamic is referred to as ’’adjusted replicator dynamic” and describes the evolution of 
the proportion of individuals in population i adopting strategy r (see [9]). This evolution equation 
expresses the fact that the relative increase {F{p)ir -Pir)/Pir of p*r equals the relative excess fitness 
of subpopulation (z, r) over the average fitness of population i. 

We shall combine in our fitness function the effects of two dynamics: The first, being of gradient- type, 
has the task of attracting the system towards assignments with high objective values, the second dy- 
namic should care for making logically forbidden assignments repellors. Ideally, in the combined 
dynamic, the effect of the first dynamic should be stronger in the neighborhood of admissible assign- 
ments, and ’’vice versa”. For the gradient-type dynamic we choose F[F“] with 

Tir := dz/dpir = Wt n Pj> 

T:(i,r)eT U.‘)€T\{i,r) 

where z is the objective function (1). For the special case a = 1, this dynamic has been studied in 
[1] and it has been shown that in this case z(F[r]{p)) > z{p),p G A°, with equality for fixed points 
only. (In fact this result remains true, if a < 1, see Theorem 2.4.) 

For the second dynamic, similarly to [4] we choose F[©^], where 

©ir := n (1- n Pj‘)- 

R:{i,r)eR (j,s)€/t\(»,r) 

Note that if p is near p G A^ and p violates some constraint R, then for (z,r) G R the fitness 0ir(p) 
is as desired very small, what should contribute to make pir smaller. 
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The combined dynamic F is then defined as 






PirFirCp)" • QivjpY 

TH=iPi,^i.ipY ■ Qi.(py 



In Figure 1 the effects of the three dynamics are shown for the C-SAP with n = k = 2 



(5) 



max zip) = 4pnP2i + bpuP 22 + 2 puP 2 i + SpnP 22 
s.t. P 11 P 21 = 0, P 12 P 22 = 0, p € 

In the first picture we show the trajectories for the gradient- type dynamic used to maximize 

z{p) (without satisfyability constraints). The dynamic F[0] adequate to ”fiow away” from the 
forbidden assignments (without optimization) is represented in the second picture and the trajectories 
of a combined dynamic F[r^0®*®] are shown in the last picture. 




Figure 1: Trajectories of F[F®-^],F[0] and of the combined dynamic F[F^0°-^] 



Note that F[0] cannot be extended to the boundary of A for some denominator may vanish at 
infeasible assignments. For theoretical reasons (see Theorem 2.2) we shall restrict the domain of all 
the operators involving 0 to 

k 

A" := {p e[e,l-{k- = 1 for all i e N}. 

r=l 

If not discussing the current value of e we consider e as fixed. The range of our operator must 
be A^ as well and we define ^ : A A^ as follows: If pir < e then ^(p)*r •'= s, ^(pi^(i)) 
min{pir(i), 1 - (A: - 1)£:}, where Pir(i) > Pis, s < r{i) and p*r(t) > Pis, s > r{i) and finally for the 
remaining components ^(p)»r := e + p(Ptr ~ ^), with p chosen appropriate to fulfill ^(p) e A. We 
denote 

FK]:=^oF[f] (6) 

and define Round : A — ^ A^ by Round{p)ir = 1 iff r = r(i). 

As mentioned in the introduction, one motivation for considering such a heuristic is that if a C-SAP 
has a very big isolated optimum, our heuristic should be able to find it. This is stated in the following 
theorem. 
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Theorem 2.1 Let (7^, T) be a C-SAP instance with p a feasible point in {0, and 6 > e. For 

the extended instance UT') with V := {(i,r-(i))|i G iV,r(i) such that Pir(i) = 1} and weight 

_ 2 ^/** Y^TeT '^T 

^ == ^(n-l)+W/a+l/a’ 

any sequence p®,p^ := F(jP) = ^(F(p°)), . . . with pjj > (5, Vi G iV converges to ^(p). 



Proof: Let p := p^ 
i £ N. Then 



for some j > 0 and p' := F{p). Moreover, assume w.l.o.g. that pn = 1 for all 



Pir = Uir • Ptr 



with 



r<r(p)“ • Qirlpy 
Ej=iPi.r<,(p)“ • Guipr 



We show that u*r < 1/2 for alH G AT, r G K \ {1}. 

Therefore we need that for all i G iV, rir(p) < St€T for all r G AT\{1}; rji(p) > ^ 

and 0ii(p) = riR:(i,i)€fl(l - rio, 5 )ei 2 \(i,i) Pjs) > The last inequality is true, since p is feasible and 
therefore (i, 1) G R implies that there exists e R, s' ^ 1. Hence (1 - U{j,s)eR\{i,i)Pjs) > 

1 “ Pj's' ^ 

Note that for i G iV, r ^ 1 



^ Tiripr^Oiripr ^ {ErerWTr 

Ej=i PisTisip)'^ • Oisipy Tiiipy • e»i(p)^Pii 
(Srer^T)°‘ ^ (Srer^r)° _ 1 
- “ 2(ETer^T)“ 2 



Since < 1/2 for alU G iV,r ^ 1 and ^(p')ir < Pir if Pir > we have pj;^^ < IpJ,. or pj;^^ = e, 

which ensures convergence of p®,p^ := F{p^)j ... to ^(p). 4^ 

Clearly the factor T in our dynamic is responsible for the nice behavior just discussed and we shall 
return later to the specific nature of the dynamic F[T]. 

The above theorem shows that if the objective value of a feasible point p G is high enough, all 
trajectories lead to it. What happens, however, if an infeasible point p has such a high objective 
value? In fact, the repelling part of our operator acts nicely if p has a neighbor (with respect to 

change one value of one variable), for which less clauses in 1Z are violated. 



Theorem 2.2 Let Fir > I for i e N,r € K, /3 > I and p £ be such that there exists a neighbor 
q e A^ of p which satisfies strictly more clauses in % than p. Then for an appropriate choice of e 
there exists a neighborhood U of^{p) such that Round{F(p)) ^ Round{p),p G U. 



Proof: Note first that the condition Fir > 1 is not restrictive, since for z := z E»rPtr we have 
2 := z + n, Vp G A and z satisfies the hypothesis on F. The proof is similar to that of Proposition 
3.1 in [4] and we give here just the main ideas. Let p G A^ be an assignment for which Ui clauses 
are not satisfied and q, obtained from p by changing the value of one decision variable, for which 
U 2 < ui clauses are not satisfied. W.l.o.g. we assume that Pii = 1, Vz and gi 2 = 1. For simplicity we 
assume that all clauses have cardinality two. For p = ^(p) those factors of ©n(p) := riie:(i,i)efl(l “ 
n(j s)eR\{i,i) Pjs) corresponding to clauses satisfied by p are equal to 1 - e and their product can be 
approximated by 1 for our purpose (Actually we chose e: > 0 small enough for our needs). The other 
factors (corresponding to violated clauses) are equal to e and there are u < Ui of them, so that 
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0ii(p) « Similarly, Buip) w and therefore 0i2(p)/Bii(p) « Let Tmax be an 

upper bound for Tir.i e N^r e K, then 

^ ^12P12 ^ ^ _J_^)9(u2-«i)+1 

^(p)ii ~ rs^efiPu - rs^ef, ~ 

By a good choice of 5 > 0 this can be made as large as needed to ensure that F{p)i 2 > F{p)n. 
This can be done a priori for each instance of C-SAP (on the basis of the number of the variables, 
maximal size of the clauses, and upper bound for the ratio of the r*,. all assumed to be greater than 
one). By continuity, the result holds also in some U{p). 4 

Note that e is dependent on the instance. If we want to construct an operator F which satisfies 
Theorem 2.2 then for an instance as in Theorem 2.1 e will be smaller than S. 

These two theorems describe nice properties of our combination of the ’T-part” and the ”0-part”. 
If for example we construct a penalized objective function of the form 

Z(p)=z(p)-K{Y1 n Pir) 

Ren {i,r)eR 

and maximize it by some gradient procedure, we loose these properties, if we want K to be large 
enough for a hill climbing procedure to prefer a feasible assignment with objective value 0 to any non 
feasible neighbor with value c > 0. In this respect a reasonable choice for K would be the sum of the 
coefficients of 2 ;. Consider now the (n = 2, A; = 2) instance maxcpnp 2 i s.t. P 11 P 22 = 0, P 12 P 21 = 0 
and p € A. Our choice gives 



i(p) = CP11P21 - c(piiP 22 +Pl 2 P 2 l)- 

The direction of the resulting gradient has then no dependency on c, as well as the basin of attraction 
of (0, 0) and (1,1). Hence the property of Theorem 2.1 is lost. 

By Theorem 2.2 we can not converge to an infeasible point having better neighbors, from the feasibility 
point of view. How about convergence in general? The following example shows that cycling is 
possible: For the problem 



max zip) = P 12 + P22 + 5000(piiP2i) 
s.t. P 11 P 21 = 0, p € A^ 

consider F of (5) with a = 1 and /3 = 4. Note that {p € A|pn = P21} is invariant under F due to 
the symmetry of the instance and let /(pn) := F(pu, 1 — Pii,P2i = Pii, 1 — P2i)- It can be shown 
that / has a cycle of period 3 (/(0.174) « 0.9979, /(0.9979) « 4 • 10-^ /(4 • 10"^) « 0.174) and 
therefore cycles of any period exists by Sarkovskii’s theorem [5]. 

Nevertheless, in our numerical experiences convergence of the algorithm was observed and a useful 
stopping criterion can be given for testing if for each i e N the biggest component is going to increase 
for ever given that the fitness functions ^ir^i ^ N^r e K are polynomials. We show it for functions 
^ir with positive coefficients (the necessary modifications for negative coefficients is straightforward). 
The stopping criterion works as follows: Given p 6 A° (w.l.o.g. we assume that pn > Pi,., Vr > 1) 
we want to test if the sequence of iterates F‘[^](p),t > 1 converges to p* where pj^ = 1 for all i. 
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Decompose now ior i £ N^r ^ K iir as H- f," where consists of those monomials in for 

which all variables have second index equal to 1. Note that ^»r(p*) = ^iAP*)- Finally let q G 
with Qii = 1 and qij. = 1 — Pir^ Vr > 1. 

Stopping criterion: If alH G iV,r > 1 then STOP. (7) 

Note simply that under the assumption that pn is not decreasing for i € N, ^hip) is and will remain 
a lower bound for fii(F'[^]*(p)), t > 1 and for i G iV, r > 1, CiriP*) will remain an upper 

bound for ^ir(F'[f]‘(p)) for t > 1. Therefore we have for all t > 1, ^ti(F'[^]^(p)) > ?" > 1 

and F"[^]^(p)ii will remain greater than F'[^]^(p)trj r > 1. Moreover, if (F"[^]*(p®), ^ > 1) converges 
to p* G and furthermore ^*i(p*) > ^ir(p*) for all i G iV and r > 1, simple continuity arguments 
show that there exist n > 1 and p = F[f]”(p®) for which this test is successful. 

We discuss now the stability of a fixed point p in the interior of for an operator F[^] with the 
property that d{^ir)/d{pir) = 0 for alH G iV,r G X (which we have in our approach): Since p is 
a fixed point, ^ir(p) = Ni{p) = ErPiAirip)- Moreover, a(F[^]jr)/9(pir) = 1/Nf ■ (^irNi-pir^l) = 
1 /Nf • -Pir)) = 1 — Ptr- It follows that the trace of the differential DF{p) of F[^] at p is n{k — 1), 

viewing F[f] as a mapping with domain Note that A® (actually the range of F[f]) has dimension 
n{k-l) and that DF{p) has by construction n vanishing eigenvalues. So the sum of the eigenvalues is 
at least the number of non vanishing eigenvalues. It follows that unless all non vanishing eigenvalues 
are equal to one, at least one of them will have modulus strictly greater than one, a criterion sufficient 
to imply that the fixed point p is unstable. The following slight modification of F[^] implies this 
desirable property for all fixed points in the interior of A^ [4]. Let Ojr := rii<-<* (1 —Pis) and take now 
as fitness function with 0 < h. The trace of the corresponding differential is then n{k - 1) +nh 
and all fixed points in the interior are unstable. Furthermore, h can be chosen small enough to have 
otherwise a negligible effect and not alter our results in A^. This f2-modification is a theoretical 
artefact and was not implemented. 

To conclude this section, we present some results for the special case of C-SAP without feasibility 
constraints, i.e. 7?. = 0. 

Let Vz be the vector field associating to every p G A° the vector Vz{p) := FiLKp) - p. 

Proposition 2.3 Let A® be equipped with the Riemannian metric corresponding to the scalar product 
given by the nk x nk diagonal matrix Q(p) = (qirjs{p)) {qir,ir{p)) = TfrPir^ir{p)/Pir, then Vz is 
the gradient of z with respect to Q. 

Proof: This metric is a straightforwad extension of the so-called Shahshahani metric (see for instance 
[8]). The proof consists in verifying directly that for every p G A® and for every vector v in the tangent 
space of A° at p (i.e = 0 for all i) we have K(p)'Q(p)t^ = Dz{p){v), where on the right-hand 

side we have the differential Dz{p) oi z dX p applied to v (see [6] for the relevant background). 4 
A pleasant feature of our formulation (1) is that the objective is expressed as a harmonic function 
and therefore has no local optimum in A° if not constant. Recall on the other hand that gradient 
trajectories may not converge to a point. In our case however, it can be shown that if a trajectory 
has an accumulation point which is a local optimum in the interior of a face, then it converges to it. 
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Note furtermore that different functions on have the same restriction to A and induce different 
dynamics. 

The introduction of our exponent a in (5) rests on the following generalization of [1] and can be 
proven similarly. 

Proposition 2.4 Let g : IR+ IR+ be monotone and concave and let Gir{p) := p(rjr(p)) p G 
ieN.reK. Then 

ziF[G](p))>z{p)peA^ 
and 2 (f [G](p)) = z{p) only if F[G](p) = p. 



3. Implementation and empirical results 



We tested our algorithm for two different problem classes. The first class is max-cut. The second 
contains randomly generated C-SAP instances for which finding feasible solutions is neither too easy 
nor too hard. In order to accelerate the evolution of the system, F of (5) is applied in a sequentially 
deterministic block Gauss-Seidel fashion, i.e. for j € N let 

Fi(p)ir ~ I (8) 

I pir otherwise 



Instead of F of (5) we apply F' := Fn o ... o Fi. 



3.1 Empirical results for the max-cut problem 



Given a graph G = (V, E) and non-negative weights Wij for (z, j) € E. The max-cut problem consists 
in finding 



max > 1 

SCV 

i£S,j^S 



This problem can be formulated as a C-SAP with n := \V\ and A; = 2 with: pn = 1 if node i G S 
and Pi 2 = 1 otherwise. The set of weighted clauses T is 



7" := U ({(*i.J 2 )}U{(i 2 ,yi)}) and 71 = 0. 

(ij)eE 

The objective function of the C-SAP is therefore 



^(P)= ^ijiPiiPiz+PjiPiz)- (9) 

(m)€E 

To solve the max-cut problem we use the operator F\ where we choose a = 0.5 and of course j3 = 0. 
Moreover, we use a slightly stronger stopping criterion than (7): for a strict local optimum p* a 
region, so-called guaranteed basin of attraction GBA(p*), can be determined, which is contained in 
the basin of attraction of p*. Our algorithm runs as follows: we take the best solution found from 
10 starting points with components randomly chosen from a given interval J = [0.4995, 0.5005]. For 
each starting point we iterate := F'(p°),... until a generated point, say p^ is either in the 
guaranteed basin of attraction GBA(pl) for some p* € A^ or for all i € AT the biggest component 
m3x{pii,Pi2] > 1 - 10 “^ or the number of iteration k attains a given iteration limit (= 1000 ). 
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We tested the algorithm on 10 randomly generated max-cut instances with n = 50 vertices, edge 
probability of 0.2 and weights Wir uniformly distributed in the interval [1, 100] for i G AT, r € {1, 2}. 
These instances have been solved optimally using an interior point algorithm in a branch and bound 
framework [3] and therefore the optima are known. 

Each of the 10 instances has been solved 20 times (=20 runs). In all runs we were at most 2% off 
the optimum and on the average only 0.25%. Moreover, for all 10 problem instances the optimum 
has been found in the 20 runs (5 times at least for each instance, 11 times in the average). One run 
took about 5 seconds on a Pentium Pro 200 processor. Compared with the case where the GBA 
part of the stopping criteria is omitted the GBA criterion gives a speed up of about 80%. Hence 
this criterion has not only the advantage of a ’clean’ stopping criterion, but also a practical running 
time advantage. 

3.2 Empirical results for a class of C-SAP 

We tested our algorithm on randomly generated instances of C-SAP and compared it with the results 
of a Tabu Search (TS) [7]. This TS is in a working stage and possibly the results there can still 
be improved. The generated instances have n = 100 and A; = 5 and 71 contains lO'OOO randomly 
generated clauses of cardinality 3. Moreover T contains randomly generated clauses with cardinality 
between 1 and 5: 500 clauses of cardinality 1 with weights unifomly distributed in [10,20], 800 
of cardinality 2 with weights in [100,200], 1000 of cardinality 3 with weights in [500,750], 800 of 
cardinality 4 with weights in [750,1500] and 200 of cardinality 5 with weights in [1500,2500]. This 
choice was motivated by the following observations: first, there is no apparant relation between the 
best solutions found for this problem and the ones with 7^ = 0; second, the mixture of the clauses in 
T satisfied by the best found solutions has no apparent pattern. 





DSH 


Tabu 




min 




max 


min 


avg 


max 


PI 


42941 


44704 


47081 


44436 


46725 


49809 


P2 


51498 


53050 


55134 


51932 


53449 


55623 


P3 


52988 


54094 


55417 


51564 


54843 


58549 


P4 


53238 


54326 


56876 


53386 


55526 


59128 


P5 


53422 


54885 


56891 


54968 


57035 


59849 


P6 




52722 


55236 


50840 


54159 


57679 


P7 


51221 


52694 


56100 


48233 


53465 


55897 


P8 




54004 


57186 


50701 


52428 


54219 


P9 


51642 


53446 


55667 


53226 


54729 


58295 


PIO 




52693 


54830 


51371 


54895 


57575 



Table 1: Comparison DSH with Tabu 



One run of the implemented algorithm (DSH) takes the best solution found from 100 starting points 
(this number has been chosen in order to use equal CPU-time as TS from one starting point.) For 
each starting point we essentially iterate := ^(F[r®*®0^](p)), . . . In order not to get stuck 
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at the boundary of we recenter the points every 10 iterations (i.e. add a constant to all variables 
and normalize). Moreover, the encountered solutions were improved by a greedy algorithm (first 
priority: satisfiability, second priority: objective value). 

Table 1 shows the results of our heuristic DSH compared with TS for 10 instances. Each instance was 
solved 10 times (= 10 runs) and the minimum (min), average (avg) and maximum (max) over these 
10 runs are given in the table for the 10 instances. Though DSH is on the average 1.9% worse than 
TS it nevertheless has found 2 better max and 4 better min values. We are confident that there is still 
some room for improvement, in particular the order of the variables in our Gauss-Seidel procedure 
could take advantage of the available information. Moreover, DSH is less sensitive to shortage of 
CPU time. To conclude, DSH is (still) slightly worse, but it is an interesting alternative which could 
be combined with other LS heuristics, due to its completely different approach. 
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Switzerland. 
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On the Mathematical Treatment of Pileproblems 



L. K^merer, Bauhaus-Universitat Weimar 
L. Hempel, Bauhaus-Universitat Weimar 

Summary: Pileproblems are difficult discrete optimization problems. They deal with practical 

tasks like loading and unloading trains trucks or ships. We find problems of this kind also in civil 
engineering and in logistics of course. Even simpler definitions remain such problems hard to solve. 
We consider the following problem: A given pile - consisting of elements Vi of a set F - should be 
moved step by step to a final pile with a possibly different structure. To reach the required final 
situation there are auxiliary places available. Calculating the number of necessary auxiliary piles 
is known to be NP-complete. We report on experiences with a branch-and-bound algorithm and a 
heuristic method. 

Finally we introduce a more general problem that yields when using a mapping between the elements 
of the starting pile and the final pile that is not unique. 



1. Introduction 



A typical example of the pileproblems that we consider is the following: 
starting situation: final situation: 



mm 

3 I 



4 



mm 



m 

m 

im 

I mm 



Step by step the pile from the starting situation is to be transformed to the required structure of the 
final situation on another place. In every step only one element from the top of the starting pile or 
from any auxiliary place may be moved. To determine the minimal number of steps to finish off this 
task, we have to calculate how many elements have to put aside on an auxiliary place. The structure 
of the piles can be modelled as acyclic directed graphs, and each vertex is linked by a directed edge 
to another vertex, if and only if the corresponding element in the pile rests on top of the other. For 
the example above we have the formal pileprohlem Q = (Gs, Gp, k) with 
Gs = ({1,2, 3, 4, 5, 6}, {(1,3), (2,3), (3,4), (4,5), (4,6)}) and 

Gp = ({1,2, 3, 4, 5, 6}, {(4,5), (3,2), (5,2), (2,1)}), where k auxiliary places are available: 

Starting pile Gg: final pile Gp: 1 



1 



3 ^ 



4 ^6 



5 



6 



3 
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Definition 1 A pileproblem Q = (Gs, Gp, k) is called solvable, if and only if there is a sequence of 
movements of elements, such that at most k elements have to put aside when piling up the the final 
pile. 

For every auxiliary place only one element is allowed to lie on. Therefore the number of elements that 
must be put aside in order to solve a pileproblem is the same as the number of necessary auxiliary 
places. The example Q above is solvable for every A; > 2. 

We are now interested in the smallest number k, a pileproblem Q = (Gs, Gp, k) remains solvable for. 
It holds: 

Lemma 1 A pileproblem Q = (Gs,Gp,0) is solvable if and only if Gs U Gp contains no cycles. 

Proof: The decisive factor for the lemma is that the arcs in Gs and Gp are directed in the opposite 
way: In Gs they are directed top to bottom, in Gp they are directed bottom to top. 

(” =>” ) Suppose that Gs U Gp contains a cycle C. We denote that element of C that is put first to 
the final pile by y. Because y is contained in G, there is an x with {x, y) € C. On the other hand 
the directed edge (x, y) is in Gs, otherwise x has to be put to the final pile before y. If we now want 
to access y in order to move it to the final pile, we first have to put x to an auxiliary place. This 
contradicts our assumption. 

(” <=” ) Suppose Q is not solvable. Hence every sequence of moves of elements leads to a situation Si 
in which no element from the top of the starting pile can be moved to the final pile nor to any of the 
auxiliary piles. 

The remaining pile on the starting place in situation Si is denoted by G^. Now W is the set of 
elements resting on top of G^. U is the set of vertices of G^ that are not in W. 

Because none of the vertices of W can be put to the final place it holds: For every w ^ W there 
exists an u € f/, such that there is a path leading from u to it; in Gp. This means: Before any w 
from W can be put to the final pile, & u S U has to be put there. On the other hand we know by 
definition of U : For every u e U there is a u; e W, such that there is a path leading from it; to u in 
G'. 

To each vertex of U there is a path from a vertex in W in Gg U Gp. In the same way to every vertex 
of W there is a path from a vertex of U in Gs ^Gp. Because U and W are finite sets, there is a cycle 
in Gs and therefore also in Gs U Gp. This is a contradiction, our assumption is wrong. □ 

It follows immediately that cycles in Gs U Gp are responsible for putting elements to auxiliary piles 
when solving pileproblems. We obtain 

Lemma 2 Let Q = (Gs,Gp,A:) be a solvable pileproblem. Then it holds: For every cycle C in 
Gs U Gp there is a vertex xc with {xc^yc) ^ G and (xc,yc) ^ ^(^s); which must be put to an 
auxiliary place in order to solve Q. 

Proof: We consider a cycle G = [xi, X 2 , . . . , x\] in Gs UGp. Let x* be the vertex in G, that is put 

first of all vertices Xi, . . . ,x«, to the final pile. Therefore the arc (xj_i,Xi) can’t be in Gp, because 
Xi has to be put to the final pile before Xj_i. That’s why (xj_i,Xj) € >l(Gs) holds. That means: 
Xj_i must be taken away from the starting pile before Xj, but may be put to the final pile after Xj. 




194 



This can be realized only if Xi-i is put to an auxiliary place. The arc is the arc {xc,yc) 

mentioned in the lemma. □ 

For cycles C in Gs U Gp with a special structure we can determine that element of C which must be 
put to an auxiliary place with the following lemma: 

Lemma 3 SCHMlEDEL/i2, 1] Let Q = be a solvable pileproblem and 

C = [ai, . . . , Ox, 6i, . . . , 6y, ai] a cycle in Gs U Gp, such that Pi = [oi, . . . , Ox, 6i] is contained in Gs 
and P 2 = [6i, . . . , by, Oi] in Gp. Then Oi must be moved to an auxiliary place to solve Q. 

Proof: With lemma 2 it follows that there is an element Oj € {oi, . . . , Ox}, that must be moved to 
an auxiliary place to solve Q. Since P( = [oi, . . . , Ox] is contained in Gs this vertex must be Oi. □ 

2. Algorithmic Complexity 

Calculating the minimal number of auxiliary piles necessary to solve a pileproblem turns out to be 
difficult: We define SP := { Q = (Gs, Gp, k); Q is a solvable pileproblem}. 

Theorem 1 Schmiedel/S, 1] Deciding whether a given pileproblem Q is in SP is NP-complete. 

We can give the following boundaries for the number of necessary auxiliary piles: 

Theorem 2 Let Q = (Gs,Gp,A;) be a solvable pileproblem, where k is the smallest number, Q 
remains solvable for. c is the number of cycles in GsUGp, dc the number of disjoint cycles in GsUGp 
and msec the number of strongly connected components in Gs U Gp. It holds: c> k > dc> msec. 

Proof; There may be more than one disjoint cycle in one strongly connected component, but there 
is no cycle lying in more than one strongly connected component. Therefore dc > msec holds. 
Assume k < dc. By lemma 2 at least dc vertices must be put aside in order to solve Q. Hence k> dc 
holds. 

Suppose that c < k. By putting c vertices aside, Q can be solved. On the other hand k is the smallest 
number of necessary auxiliary places Q remains solvable for. Therefore we have c> k. D 
Now there are two ways of algorithmic treatment of the problem. The first way is to solve it by 
complete enumeration, which surely takes a long time to compute, due to the NP-completeness of 
the problem. The second way is to obtain an approximate solution by a polynomial time procedure. 

3. Branching Bounding 

To compute the minimal number of necessary auxiliary places we use a simple technique: Step by 
step we simulate the moving of the elements from the starting pile to the final pile place. If there 
is no element on top of the starting pile that can be moved directly to the final pile we need an 
auxiliary place. 

Next we check if there is a cycle containing a vertex v that also rests on top of the starting pile and 
which satisfies lemma 3. In this case we move v to an auxiliary place. After this we may have access 
to new vertices on top of the starting pile. Now we can search for vertices on top of the starting pile 
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which can either be moved directly to the final pile or which satisfy lemma 3. We continue these 
steps as long as there are such vertices. 

But once we reach a point where we don’t find such vertices - we have to move an element from 
the starting pile to an auxiliary place. The problem is now that we do not know the right vertex to 
choose, that’s why we have to make the whole computation for every vertex on top of the starting 
pile: We move one vertex to an auxiliary pile and solve the reduced problem recursively. This 
backtracking-procedure can be shortened to a simple branch-and-bound algorithm. As a result we 
obtain the smallest number of auxiliary piles that are necessary to solve the problem Q. 

4. Heuristics 

The heuristic procedure we considered works similar to the branch-and-bound algorithm. At the 
point where there is neither a vertex to move directly to the final pile nor a vertex satisfying lemma 
3, we have to choose only one element to move from the starting pile to an auxiliary place. 

This decision is not corrigible, that’s why our choice should be depending on a well defined criterion. 
For that reason we assign a value to all possible elements (the elements on top of the starting pile) by 
a function / and choose a vertex v for which the value f{x) becomes a maximum. The best results 
were obtained by the following function /: 

Definition 2 Let Q = (Gs^G^jk) be a pileprohlem and vi,...jVt the elements on top of Gs- Ei 
denotes the set of successors ofv{ in Gs, which have no predecessor except Vi. We define the following 
function f for i = l,...,t; 

f{vi):=\{veEi: v can be put directly to the final pile or satisfies lemma ,?.}| 

Hence f{vi) is the number of vertices that either satisfy 3 or can directly be moved to the final pile 
right after Vi is taken away from the starting pile. 

5. Experiences and Results 

Before starting some experiments a number of example piles are to generate. We varied the density 
of arcs in the piles and the graphs had at most 100 vertices. 

When making the first experiments it happened that a lot of pileproblems were solvable in reasonable 
time. This means: In the branch-and-bound algorithm there is no branching necessity - in the 
heuristic algorithm none of the vertices must be chose by /. Step by step we can find vertices which 
satisfy lemma 3 or vertices that can be moved directly to the final pile until the whole problem 
is done. Of course, the results by the branch-and-bound and by the heuristic algorithm are the 
same. There were 87% of the examples solvable in reasonable time, at a density of 25% of arcs. All 
statements about the heuristic algorithm in the following are with regard to those examples that 
cannot be solved in reasonable time, i.e. branching is always necessary. 

5.1 Quality of the Solutions 

We started a series of experiments to measure the quality of the heuristic algorithm and considered 
a density of arcs of 5,10,15,20 and 25 per cent. In table 1 we give the percentage for the optimal 
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solution (O) by the heuristic algorithm as well as for “good” solutions (G) (good solutions differ at 
most one element from the minimum). 



Quality of the 
heuristic algorithm 


Den 

5% 


B 


arcs: 

15% 




25% 


Percentage of exact solutions 


84 


76 


80 


82 


86 


Percentage of good solutions 


97 


95 


97 


97 


98 



Table 1: Percentage of optimal and good solutions provided by the heuristic algorithm for different 
density of arcs 



5.2 Computing Times: Branch-and-Bound vs. Heuristics 

The computing times for the branch-and-bound and the heuristic algorithm are odd: The considered 
examples can be divided into two sets. The first and larger one contains piles for which the computing 
time is almost the same for both algorithms. The computing time TneunsticCQ) for the heuristic 
algorithm is less than one second for these examples. (All given computing times are with regard to 
a Intel Pentium 133 with 32 MByte main memory.) The computing time TBranch-and-Bound (Q) of the 
exact method is also less than one second. 

In the second set we find those piles for which the heuristic algorithm is much faster than the exact 
method, we find computing times from 30 seconds up to several hours, but the heuristic algorithm 
takes only a few seconds for the same examples. For bigger sized problems (n > 200) it’s difiScult 
to decide whether the above statement is true or not, the heuristic algorithm also works very fast, 
but it’s almost impossible to prove the quality of the results because it’s impossible to calculate the 
exact values. In table 2 we show the computing times of the branch-and-bound algorithm depending 
on the structure of the piles. We considered the same examples as in table 1. 

Finally, the combination of both algorithms seems to be a good procedure: First is to check if the 
branch-and-bound algorithm returns a result within a defined time interval and only if it doesn’t 
stop, the heuristic algorithm should be used to solve the problem. 

6. Non Unique Pileproblems 

An obvious generalization of the pileproblems considered so far is a non unique assignment of the 
elements of the starting pile to the elements of the final pile. This yields to the term of non unique 
pileproblems. 

Definition 3 R = {Gs = (Vs,^s), Gp = (Ff,Af), Gm = {Vs,VF,M),k) is called non unique 
pileproblem, if and only if 

i) Gs and Gp represent starting and final pile, respectively 

ii) A: > 0 is a natural number. 
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Computing time 


Densi 

5% 


ty of £U 
10% 


■cs: 

15% 


20% 
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<ls 


87 


74 


91 


82 


59 


1 -5s 


9 


17 


6.7 


15 


36 


6 - 120s 


3 


5 


1.7 


2.1 


4 


121 - 7200s 


0.8 


3 


0.4 


0.7 


0.6 


> 7200s 


0.2 


1 


0.2 


0.2 


0.4 


number of investigated examples 


1240 


2785 


3043 


2242 


1623 



Table 2: Computing times of the branch-and-bound algorithm depending on the density of arcs 



iii) Vs and Vp are disjoint and it holds [Vsl = |Vf| and 

iv) the undirected graph Cm is bipartite and assigns elements of the starting pile to elements in 
the final pile. 

To transform the starting pile Gs of such a non unique pileproblem to the required final pile Gp it’s 
obviously necessary to find a one-to-one correspondence between the elements of the starting pile 
and the elements of the final pile. Such a correspondence is equivalent to a perfect matching E in 
Gm- Such a perfect matching yields to a unique pileproblem Q[E) as investigated in section l.to 5. 

Q(E) := (Gs, G;,, k) with G'^ := (Vs, A'p) and 

Ap := {(u,u) for all € Ap with {u,u'}, {v,v'} € E}. 

Again, we are now interested in the smallest number k of auxiliary places that are necessary to trans- 
form the starting pile of a non unique pileproblem into the final pile with regard to the assignments 
given by Gm' 

Definition 4 A non unique pileproblem R = (Gs,Gp,Gmj^) is called solvable if and only if there 
is a perfect matching E in Gm such that the resulting unique pileproblem Q[E) = (Gs,Gp,A:) is 
solvable. 

For that reason we like to know a perfect matching E in Gm for which the resulting unique pileproblem 
is solvable with the minimal number of auxiliary piles. Of course it is possible to calculate all the 
perfect matchings in Gm and to solve all the concluding unique pileproblems. But in general there 
are exponentially many different perfect matchings in Gm, therefore determining E in this way is a 
difficult task. Calculating E is intractable at all: 

Theorem 3 Calculating a perfect matching E of Gm for which the resulting unique pileproblem is 
solvable with a minimal number of auxiliary places is NP-hard. 



A proof of Theorem 3 is given in [!].□ 
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6.1 Strategies of Solving 

With theorem 3 it seems to be useful to determine the smallest number of necessary auxiliary piles 
approximately. There are the following two ways: 

• determining a perfect matching E oi Gm and calculating the necessary number of auxiliary 
piles of the resulting unique pileproblem Q(E)^ 

• calculating a perfect matching and a corresponding sequence of moves of the elements simul- 
taneously. 

In this paper we only want to investigate the first way. It’s obviously a good idea to value each of 
the assigning edges in Gm and to calculate a minimum weighted perfect matching E for Gm after 
that. 

There are several criteria to value these edges, for instance: 

1. value with regard to the components, the edge is contained in, 

2. value with regard to the depth of the vertices the edge is incident with. 



6.1.1 Components 

Now we introduce the term component, which is similar to the well known strongly connected com- 
ponent. We consider the graph which yields by taking the union of starting pile and final pile 
and additionally the edges of Gm which now should be directed in either direction: 

Definition 5 G^ := (Vs U Vp, As U Ap U M), with M := {(u, v), (v, u) for all {u, v} E M}. 

Definition 6 We call a subgraph G of G^ component ofG^ if and only if every pair of vertices in G 
is contained in a cycle C = [v \, . . . , Vjt, Vk^i = vi] ofG, such that for alii £ {1, • . . , A:— 1} (vi+i, ^^14-2) ^ 
M follows from G M, that means in C there are no two consecutive arcs from M. 

Now Gj^ can be divided in such components and we can value the edges in Gm' 

Every edge in M the corresponding arc in G^^ lies inside a component will be valued with 1, for arcs 
that link two different components in G^ we value the corresponding edge in M with 0. 



6.1.2 Difference of Depths 

Gs and Gp represent starting pile and final pile, respectively, therefore they are cycle-free. Hence for 
every vertex v of those graphs we can assign a depth t{v) in either graph: 



Definition 7 Let G be a cycle-free, directed graph. For every vertex v e V{G) the depth t{v) is 
defined as follows: 

^ I 0 ; */ V has no predecessor, 

I max{t{v'),v' predecessor of v } 1 ; else. 



Every edge {vi,V2} in M links vertices of Vs with vertices of Vp, therefore these edges can be valued 
with the difference of the depths of vi and V2’. |t(vi) - t(u2)|. 
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6.2 Results 

In our last section we want to report about results obtained by implementing the methods discussed 
in 6.1 

To compare both methods and to get a feeling of the quality of the results we proceeded like in section 

5.2 Starting pile and final pile as well as a graph Gm ~ hence a complete non unique pileproblem R - 
were generated randomly. Afterwards we calculated all possible perfect matchings of Gm and every 
resulting unique pileproblem was solved exactly. At the same time R was solved by the algorithms 
described in 6.1.1 and 6.1.2 We investigated 1000 examples, each of which had 20 elements and a 
density of arcs of 25 per cent. In table 3 we show the number of exact solutions, solutions that differ 
about one, two and more than four vertices. Due to the small size of the considered examples (20 





number 
of exact 
solutions 


results dif- 
fering about 
one vertex 


results dif- 
fering about 
two vertices 


results differing 
about more 
than four vertices 


Value by components 


140 


285 


273 


61 


Value by depth 


190 


351 


248 


24 



Table 3: Quality of both heuristics in comparison for 1000 examples 



elements), the computing times of the heuristic algorithm are not worth mentioning. 
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Mean Value Cross Decomposition Based Branch-and-Bound 
for Mixed Integer Programming Problems 



K. Holmberg, Linkoping University 

Summary: This paper deals with the usage of mean value cross decomposition in a branch-and- 

bound method for mixed integer programming problems. Special issues discussed are heuristics 
for obtaining feasible integral solutions, information available to base the branching decisions and 
tree search strategies on and reoptimization after branching or adding valid inequalities, as well 
as the availability of reduced costs, to be used for variable fixation tests and branching decisions. 
Finally application to problems of a typical structure, facility location, is discussed, and preliminary 
computational results are presented. 

Key Words: Mathematical programming, Mixed Integer programming, decomposition methods. 



1. Mean value cross decomposition 

Mean value cross decomposition, [8], [ 9 ], is a decomposition method developed for LP-problems. In 
[12], it is shown that applied to (mixed) integer problems, the method is able to yield the same 
bound as the Lagrangean dual, which can be better than that of the LP-relaxation. In this paper, 
we proceed in this direction by actually developing an exact method for mixed integer programming 
problems, based on branch- and-bound and mean value cross decomposition. 

Mean value cross decomposition has relations to other approaches, especially Lagrangean heuristics, 
see for example [ 7 ] and [2]. Advantages of the mean value cross decomposition approach are better 
primal controllability and in general more information available, both dual and primal. 

We study the following mixed integer programming problem. 

V* = min (Fx + (Fy 

s.t. Aix -f Biy < bi, X e X, y eY 

where X = {x : A2X < 62, a; > 0 }, V = (y e Vlp : y integer} and Ylp = {y : B^y < bs, y > 0 }. We 
assume that X and Ylp are bounded, and that for each y £ Ylp there exists some x e X such that 
AiX-\-Biy < bi. Let us by “F-convex” denote the special case when Yc = Ylp, where Yc = conv(Y). 
In the Y-convex case, all extreme points of Ylp belong to Y. Now define 
L{x, y, u) = (Fx + (Fy + F{A\X + Biy - 61) 
and, for any fixed y £ Ylp and u > 0, 

K(y,u) = minL{x,y,u) s.t. x € A [PDS] 

PDS is an LP-problem with a bounded, feasible solution for all y £ Ylp and > 0 . 

We now define the primal and dual subproblems, PS and DS. 

'ip{y) = maxK{y,u) s.t. u > 0 
ip{u) = min K{y,u) s.t. y £Y 
Note that PS and DS can also be expressed as follows. 

'ip{y) = min c'^x + Fy 

s.t. A\x <bi — Biy, X £ X 



[PS] 

[DS] 

[PS2] 
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[DS2] 



(p{u) = min c^x + (Fy + fF{AiX + Biy — bi) 
s.t. X e X, y eY 

We can also note that DS2 separates into one LP-problem in x and one IP-problem in y, and that 
PS2 is an LP-problem. 



Lemma 1 'ip{y) is convex and piecewise linear, (p{u) is concave and piecewise linear. 



The following “master” problems can be used to find the best values of y and u. 

V* = mm'tp{y) s.t. y eY [PM] 

vl = max(p{u) s.t. u > 0 [DM] 

DM is simply the Lagrangean dual of P, and we have the following well known results. 

Lemma 2 (f{u) < vi < v* > 0. '0(y) > u* Vy € Y. 



The LP-relaxation of P, PL, has objective function value vlp, and is obtained by replacing Y with 
Ylp The LP-relaxation of DS, DSL, has objective function value and is also obtained by 

replacing Y with Ylp- Let us now define some convexified master problems. 



vlp = min^(y) s.t. y G Ylp 


[PML] 


vlp = max(pLp{u) s.t. u > 0 


[DML] 


Vl = minV^(y) s.t. y eYc 


[PMC] 



PML and DML are simply the LP-relaxed versions of PM and DM. The fact that the objective 
function value of PMC is vl is proved in [7]. 

Since Yc is not explicitly given (except in the Y-convex case), PMC is an implicit problem, 

also can be stated as 

Vl = min (Fx -f (Fy 

s.t. A\X + B\y ^ 6i, X e X, y e Yc 

Definition 1 DS is said to have the integrality property, [7], if (p{u) = (Plp{u) Vu > 0, 
replacing Y with Ylp in DS has no effect on the objective function value. (This is the case 
Y -convex case). 



which 

[PC] 

i.e. if 
in the 



Lemma 3 If DS has the integrality property, then vl = vlp- 

Lemma 4 All y -solutions of DS (DSL) belongs to the polytope Yc (Ylp). The u-solutions of PS can 
be restricted to a polytope, Uc- 



A mean value cross decomposition method is built on a suitable combination of a primal master 
problem and a dual master problem, which both has the same optimal objective function value. An 
inherent difficulty lies in the usage of mean values, i.e. convex combinations, which prohibits the use 
of PM and guaranteed convergence towards v*. 

We can identify two possible combinations of primal and dual master problems. 

I. DM and PMC, both yielding vl. 

II. DML and PML, both yielding vlp- 
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In the mean value cross decomposition algorithm, [8], [9], one iterates between a dual subproblem, 
DS or DSL, and a primal one, PS. As input to PS, the mean value of the previous solutions of DS 
is used, and vice versa. Let, in iteration k, be the input to DS and its output, and the 
input to PS and its output. Below we describe the mean value cross decomposition method in an 
algorithmic form. 

Algorithm MVC: 

1. Get > 0. Let A; = 1, u = —oo and v — oo, 

2. Let = \v!^ -f ^ 

k k 

Solve DS: y^ = argniini("(i/,u*). 

Let = ^(u^) = K{y^, u^). If > u, let y — </?*. 

3. If u - u < £:, stop. 

4. Let y'° = ij/* + 

Solve PS: u* = Qigm^K{y^ju). 

Let = K{y^^ u^). If < u, let v = 

5. If u — u < e:, stop. 

Otherwise let A; = A; + 1 and go to 1. 

Lemma b eUc VA:. y^ € Yc VA;. 

An alternate algorithm, where y^ 6 Ylp, is obtained by the following modification. 

Algorithm MVC-LP: In step 1 of MVC: Replace DS by DSL. 

Theorem 1 MVC-LP solves (asymptotically) the problem PL. 

Proof: Use [9] with PL as the original problem. □ 

Theorem 2 MVC solves (asymptotically) the problem PC. 

Proof: Use [9] with PC as the original problem. □ 

While MVC-LP solves PL, MVC solves the stronger problem PC. The primal convergence towards 
the optimal solution constitutes a better primal controllability than in Lagrangean relaxation and 
subgradient optimization. So MVC has advantages both compared to solving the LP-relaxation and 
to solving the Lagrangean dual with subgradient optimization. 

The mean value cross decomposition method can be considered as a generalization of the Kornai- 
Liptak method, [14]. In the separable case, mean value cross decomposition is in [8] shown to 
correspond to a two-person, zero-sum, finite matrix game., where the players find the best counter 
strategy based on the mean values of all the previously used strategies of the opposite player. This 
is the Brown-Robinson method for such games, [3], [16], also known by the name Fictitious Play. 
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1.1 Heuristic variations 



Algorithm IMVC: (Interrupted mean value cross decomposition) 

In step 4 of MVC: Every Kth iteration, let = y^. 

In IMVC, the solution of DS is used instead of the mean value as input to PS regularly. This makes 
y^ e Y every Kth iteration, and towards the end of the procedure it might possibly yield y*, the 
optimal solution, before y^ gets close to it. 

One might also use weights different than 1/A; in the convex combination of previous solutions. Let 
/if be the weight of y^ at iteration k. The convex combination in iteration k must be calculated 
with the help of the convex combination in iteration A; ~ 1 and the new solution. Assuming that 
and that is given, we get y* = Ef=i = mIj/* + T,'i=i = lAv'‘ + 

Clearly we must have 0 < CTjfe < 1, and in order to obtain new interesting cases, we should have 

0 < (Jjfc < 1. Also we must have /if + crjfc = 1 and /ij = 1. 

Algorithm WMVC: (Weighted mean value cross decomposition) 

In step 2 of MVC: Let = /ifu*^ + CkU^~^. 

In step 4 of MVC: Let y^ = fi^y^ + (Jky^~^- 

An interesting possibility is 






/J + 7 



and 



(Jk ■■ 



pk-P 



Pk-\-j /?A;H-7 

where we assume that p > 0 and 7 > 0. (Letting 7 = 0 would yield /if = 1/A; and ak = {k - l)/k, 
which is the original MVC.) 



2. The branch-and-bound framework 

In order to exactly solve linear integer programming problems, mean value cross decomposition can 
be used in each node of a branch-and-bound tree, in order to supply lower bounds and branching 
information, and also hopefully feasible integer solutions. If all the y-variables are fixed, PS will 
immediately yield the rest of the solution. So theoretically this method is guaranteed to produce the 
optimal solution. 

Feasible, integer solutions must be generated. Branching must be done in an efficient way. After 
branching, the MVC procedure must be continued. An efficient reoptimization procedure is then 
important. Addition of valid inequalities or facets for the convex hull of the feasible set of P could 
be used to improve the lower bounds in the tree (“branch-and-cut”). In many cases it is possible 
to device fixation tests that fix single variables during the procedure, thus decreasing the size of the 
problem. In order to construct such tests, reduced costs are needed. 

2.1 Reduced costs and fixation tests 

A fixation test can be used to fix the value of a discrete variable, and thus to decrease the size of the 
problem and the tree. To be able to construct dual fixation tests, one needs a reduced cost, di, for 
each variable, obtained from DS. Such a reduced cost determines the optimal value of a variable but 
also yields the consequences of fixing the variable to a non-optimal value, which can be compared to 
the present bounds. 
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Reduced costs for single variables are always available for LP-problems and completely separable 
problems. However, for non-separable non-convex problems, only approximate reduced costs might 
be available. 

The fixation tests are of the following type, where v is the best known upper bound. 

* If dj > 0 and (p{u) -\-di>v then fix yi = 0. 

* If di < 0 and ip{u) — di> v then fix yi = 1. 

These tests can be used in every node of the branch-and-bound tree, and possibly even in each 
iteration of MVC, since they are computationally cheap. 



2.2 Heuristics for obtaining integer solutions 

When using mean value cross decomposition within a branch-and-bound framework, a difficulty is 
that MVC has asymptotic convergence. Even if the primal solutions are approaching the integer 
optimum, we might never get an integer point exactly (unless all variables are fixed). So we need 
heuristics that helps find feasible integer solutions. 

The primal feasible solutions are in MVC obtained from PS (see formulation PS2), but the input 
to PS, y^, is a convex combination of a number of points in V, and quite unlikely to be integral. 
Basically there are three heuristic ways of obtaining an integer ^/-solution, ^ in F, preferably close 
to y^. 

I. Round y^ to the nearest feasible integer. (Based on primal convergence.) 

II. Use y^ (the solution of DS). (Based on dual convergence.) 

III. Combine I and II. (Based on both primal and dual convergence.) 

Case III is in principle to “round y^ in the direction of y*” . The simplest way is just to employ 
“manual” rounding in the following way. For each i: 

* If yf > [yf], then round yf up, i.e. set y^ = [yf]. 

* If yf < [yf J , then round yf down, i.e. set y^ = [y*J . 



2.3 Branching 

In an LP-based branch-and-bound method, it is usual to base branching decisions on the primal 
values of the variables. In a Lagrangean relaxation based branch-and-bound method, it is possible to 
base the branching decision on dual information, if reduced costs (in a general sense) are available. 
Keeping account of the historical values of the primal variables in the solutions of DS can produce 
information which can be used in the same way as the reduced costs. 

In MVC, we have access to all these kinds of information. We have both primal solutions, y*^, and 
dual solutions, yielding (exact or approximate) reduced costs, and primal solutions of the dual 
subproblem, y^. Therefore MVC has a potential to employ more sophisticated branching rules. We 
propose to use all the three criteria together, by using a weighted combination of the different values 
available. 
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2.4 Search strategy 

The branch-and-bound method discussed in this paper will be based on depth-first search. The basic 
search strategies are 

* Start with the most /least promising branch. 

* Start with the upper (>)/lower (<) branch. 

The first search strategies are based on previous solutions in the procedure. How promising a branch 
is can be indicated by either the reduced cost of the branching variable or the primal value of the 
branching variable. These items of information can be obtained from either the last iteration or 
accumulated from several iterations. 

2.5 Reoptimization 

Another important aspect when using MVC within a branch-and-bound method, or if MVC is com- 
bined with generation of valid inequalities or facets of the convex hull of the feasible set of P, is how 
to reoptimize when new constraints are added. 

Let us first discuss keeping the new constraints^ for the case when only the y- variables are included in 
the new constraints. We have approximate primal and dual solutions y and u, and a new constraint, 
oFy < ao, is added to Ylp- Since the constraint does not directly affect u, we can proceed without 
changing it. However, we must assume that a^y > oq. 

Let H = {y:o!Fy< ao}. Then Y = D H and Ylp = fl H, while Yc C Y§^^ n H. We 
know that y € Y§^^ \ Yc, so y must be projected into Yc, which however is an implicit problem. 

In the y-convex case, we can replace Yc by Ylp and solve an explicit projection problem. In general, 
however, this is not possible, since that would make the upper bound invalid (an upper bound on 
vlp, not on vl)- 

A practical possibility is to first do an iteration of MVC with y (obtained from projecting on H), 
i.e. solve PS with y, but disregard the objective function value (it is not a valid bound). This yields 
and consequently u^. DS (with the new constraint included) is then solved with which yields 
y^ e Y. Then we calculate y^ = {l/k)y^ -\-{{k — l)/k)y. If we can verify that y^ € Yc (which however 
in general is quite difficult), we can set y^ = y^ and proceed as usual. Otherwise we need to move 
closer to y^. 

Let y^ = ay^ + (1 ~ <^)^- Since y^ G Y, we know that a = 1 yields y^ £ Y C Yc, while a < l/k 
does not. One could possibly find some sufficiently large a in that interval, and proceed as usual. 
An alternative (that always works) is simply to let a = 1, which coincides with the IMVC technique 
used for obtaining integer solutions. 

Let us now consider the alternative of relaxing the new constraint. In this case we can without 
difficulty handle constraints including both x and ^/-variables. This is more interesting in the case of 
generation of valid inequalities, in a cutting plane approach. The new constraint is simply relaxed 
and put into L. 

In this case the change is in the dual space, so y^ does not need to be changed. Also is a feasible 
starting point (with the new multiplier equal to zero). However, the new constraint is very likely to 
be active, so one can for example start with a subgradient step for this multiplier. Then we proceed 
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as usual, with the new constraint included in PS. 

2.6 Valid inequalities 

Valid inequalities or facets for the convex hull of the whole feasible set of P can be used to strengthen 
both the LP-relaxation and the Lagrangean relaxation, i.e. to increase vl and Vip, Note that it is 
not necessary to generate valid inequalities for Yc, since MVC automatically always yields points 
within Yq. 

The type of valid inequalities that can be generated of course depends on the structure of the 
constraints, especially Y. Structures that can be used can be found in for example [15]. Let us here 
just mention flow cover inequalities based on flxed costs. 

Here it is important to note that the primal controllability, i.e. the convergence towards a primal 
solution of PC, enables the usage of the same type of separation problems as used in LP-based cutting 
plane methods. This is a clear advantage, since a lot of research has been put into the construction 
of separation procedures based on non-integral solutions. In this sense, this approach is better than 
a Lagrangean heuristic. 

2.7 Starting 

MVC is a mean value based method, and it is very important that the starting solutions are good. 
Given near-optimal starting solutions, MVC can often quickly verify the quality of the solutions, and 
terminate the method, see [10] and [13]. This is unlike subgradient optimization, which is very likely 
to immediately jump away from a good starting solution. 

Therefore it is of great interest to employ some starting heuristics, yielding good solutions, before the 
mean values are started to be calculated. Such heuristics can for example be based on subgradient 
optimization. Often it can be useful to use gradually changing weights, to “phase in” (or out) such 
techniques, see for example [11]. 

2.8 The branch-and-bound algorithm 

Below we give the complete algorithm for the branch-and-bound method, denoted by MVCMIP. N 
is the maximal number of iterations in MVC, and e the required accuracy for the MVC procedure. 

1. Let = oo, p = 0. Use a simple heuristic to initiate u, u, y and u. 

2. Let = u, and A: = 1. 

3. Let -f Solve DS: = arginmi^(?/,#). 

Let (p^ = = K{y^, u^). If > v, let v = (p^. 

4. Perform flxation tests. If all variables are flxed, solve PS and go to step 10. 

5. If — u < 1, go to step 10. 

If u - u < £, go to step 8. 
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1 A: — 1 

6. Let = -y^ H — y^~^ for all unfixed variables. Solve PS: = argmaxi<r(y^, it). Let 

K K u^O 

= 'ip{y^) — K{y^jU^). If 'tp^ < 0, let v = xp^. If y^ is integral and xp^ < v then let v = xp^. 

7. If t) — i; < 1, go to step 10. 

If V - ?; < e, or A: > iV, go to step 8. 

Otherwise let A: = A: + 1 and go to step 3. 

8. Find one or more valid inequalities, violated by y^, add them to the problem, and go to step 
2. If no violated valid inequalities can be found, go to step 9. 

9. Find branching variable. Let p = p -f 1. Choose branch to investigate. Fix branching variable 
yj and “project” y to y eYc- If all variables are fixed, solve PS and go to step 10. Otherwise 
set i; = oo and go to step 2. 

10. Let p = p— 1. Ifp<0, then stop. Otherwise backtrack to the first uninvestigated node in the 
tree. Remove all fixations not caused by branching. Use a simple heuristic to initiate y and v. 
Go to step 2. 

3. Applications 

One type of problem with structure suitable for this approach is the well-known facility location 
problem, both in the uncapacitated, ULP, and capacitated, CLP, version. In the literature, one can 
find several applications of Lagrangean heuristics to facility location problems, see for example [6] 
for the capacitated case and [4] for the uncapacitated, while [2] deals with both the cases. Also cross 
decomposition has been shown to yield an efficient solution method for CLP, [17]. Therefore most 
of the details needed in an application of MVCMIP are already known. 

The details of how ULP can be solved with mean value cross decomposition can be found in [11]. We 
note that we have a F-convex case, so DS has the integrality property. Both DS and PS are trivially 
solvable. DS is separable, so reduced costs are available for fixation tests and branching decisions. 
The projection on Yc can be made explicitly, and the primal rounding heuristic is straightforward. 
For the capacitated case, CLP, PS is a capacitated transportation problem with non-integral ca- 
pacities. Here we do not have a F-convex case, and DS reduces to a binary knapsack problem. 
Approximate reduced costs can be obtained by solving a continuous knapsack problem, and used 
for fixation tests etc. Projection after branching is not explicit as in the previous case, but more 
difficult. The primal rounding heuristic works exactly the same way as in the uncapacitated case. 
When it comes to valid inequalities, we can use the procedures proposed in [1] for identifying violated 
flow cover, effective capacity, single depot and combinatorial inequalities. Note however, that the 
knapsack cover inequalities, found in [1] to be the single most effective class, are already implicitly 
included in the problem (in Yq). 

The method for ULP has been preliminarily implemented and tested on 58 test problems of various 
sizes, from m = 10 and n = 25 to m = 100 and n = 1000. IMVC and WMVC seem to be 
somewhat more promising than MVC, and we choose WMVC as best method, with //j^ = 2/(A: + 1) 
and ak = {k - 1) / {k 1). 
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Regarding MVCMIP as a general method for structured MIP-problems, we have compared it to the 
the commercial and quite efficient code CPLEX, but also to a specialized method, the classical dual 
ascent method, DUALOC, [5]. The largest problems, m = 100, n = 1000, could not be solved 
by CPLEX. The average solution times for the problems solved by all methods are 1.18 seconds 
for CPLEX, 0.13 seconds for DUALOC and 0.15 seconds for MVCMIP. For the largest problems, 
m = 100, n = 1000, the average solution time for MVCMIP is 38.5 seconds and for DUALOC 17.4 
seconds. Since our goal is not to find the best method for ULP, but only to show that MVCMIP is a 
promising approach, we are quite satisfied. The implementation of the branch-and-bound procedure 
can certainly be improved. Our main goal here is to give an example of a structure that can be 
exploited in MVCMIP. 
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Building Reusable Software Components 
for Heuristic Search 
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David L. Woodruff, University of California at Davis 

Summary: Today, advanced heuristic approaches for discrete optimization are usually implemented 
from scratch. We discuss some crucial problems that have so far prevented us from having a “heuristics 
stockroom” with ready-to-use software components available. Then, we analyze the domain to be 
covered by reusable software and introduce and discuss a design for a framework for heuristic search. 
Finally, we summarize our experiences and discuss future research topics. 

1. Introduction 

In practice there is a great variety of discrete optimization problems. Since most of them are NV- 
hard, the modeling of the problem is an approximate one, and the data are generally imprecise, 
heuristics are the primary way to tackle these problems. The use of heuristics generally meets the 
needs of decision makers to efficiently generate “satisfactory” solutions. However, though in practice 
there is a huge number of problems to be solved, the following statement of Wolsey [24] may still 
be regarded as valid: “This profession [Operations Research] has presently more algorithms than 
applications [...]” . Compare, e.g., the number of research papers in the field of meta-heuristics with the 
number of implementations of such heuristics in real, large-scale application systems. In a statistical 
review of fiowshop scheduling research, Reisman [20] criticizes the lack of relevance to practice for 
the overall majority of research in this field. He emphasizes “that fiowshop scheduling research is 
in dire need of a paradigm shift to enhance its probability of ever becoming a tool for the practice 
of OR/MS. [. . . ] The literature should also seek out and recognize work that is satisfied with less 
elegant but implement able, better yet implemented, solutions.” That is, in practice we need easy- 
to-use application systems that incorporate the results of basic research in the corresponding fields. 
Following Nievergelt [18], this may be subsumed by “No systems, no impact!” Therefore, we also have 
to deal with the issue of efficiently building such implementations to bridge the gap into practice, 
instead of just investigating and solving simplified problems by ad-hoc-approaches. 

Why have we not seen extensive applications of advanced heuristic search methods in practice? Our 
ultimate goal is decision support, so we have to observe the characteristics of real-world problems. 
Problems in practice are diverse and evolve over time, so robust and fiexible heuristic strategies are 
required. The common incorporation of advanced heuristics in application systems requires to find 
a way to reuse much of such software and to redo as little as possible each time. However, there 
is little support for the development of corresponding software systems. Thus, one should strive to 
develop efficient mechanisms to aid in building software that implements robust heuristics to solve 
real-world problems. This especially concerns the development of reusable and adaptable software 
components for heuristic search. As an example, meta-heuristics like tabu search or adaptive memory 
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programming [10] have been successfully applied to a great variety of problems. In order to apply 
such methods for a new type of problem, the corresponding algorithms have to be “materialized” 
as executable implementations — a non-trivial task if this is done every time starting from scratch, 
especially if carried out by someone who is not deeply involved in these topics. The crucial problem 
is the transition from high level descriptions of algorithms (like those in scientific papers) to specific 
systems. One way to achieve this step would be the definition and use of a modeling language as, e.g., 
for mathematical optimization (e.g., AMPL). Such modeling systems provide an excellant interface 
between models and solvers, but only for a limited range of model types (e.g., linear) due, in part, 
to limitations imposed by the solvers. Furthermore, while modeling systems for heuristic search is an 
active research topic, it is still open whether such an approach may be generally successful [17]. 

One might refiect the corresponding situation for the mathematical programming approach, which 
is based on the declarative specification of problems as well-defined mathematical programs. This 
facilitates the use of highly advanced methods that are to a large extent independent from specific 
types of problems. Another example is constraint logic programming with logical constraints as a 
common representation. Such well-defined representation schemes allow independent development of 
problem solving methods that are applicable for different types of problems. With respect to software 
development, this enables the development and use of software modules that work without adapta- 
tion for a wide range of applications. Local search based heuristics use as common “representation” 
primarily the concept of solution spaces and neighborhood structures. With this, different heuris- 
tics can be described on a high level like usually used in scientific papers. Whereas mathematical 
programming software defines the requirement to specify problems in an explicit low-level formal- 
ism (like text files in a specific format as, e.g., MPS-files), the neighborhood search paradigm is a 
more abstract concept. Here, the search space is described implicitly — the search space is “virtually 
available” by traversing the solution space locally, step-by-step. Unfortunately, the way solutions and 
neighborhoods are defined for various types of problems is generally highly problem dependent. 

2. Software reuse and frameworks 

In this paper, we focus on building application-specific heuristic search systems by means of the reuse 
and adaptation of sufficiently universal and generic software components for heuristic optimization. 
Ideally, software might be built — analogously to the production of physical goods — by selecting, 
adapting, and combining appropriate components from a huge set of existing components. This goal 
has been discussed for a long time [14, 15]. Reusability may be defined as the ability of software 
elements to serve for the construction of many different applications [16]. Corresponding reusable 
software elements may be termed components. The need for reusability comes from the observation 
that software systems often follow similar patterns; it should be possible to exploit these similari- 
ties and avoid reinventing solutions to problems that have been encountered before. By capturing 
such a pattern, a reusable software component will be applicable to many different developments. 
Due to the unreasonable assumption that we may generally use these components “as is” , it is cru- 
cial to define appropriate mechanisms for adaptation. Thus, our goal should be to incorporate the 
knowledge of experts in different fields into corresponding software components, which may easily 
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be reused and adapted without the need to understand their inner workings. The application of 
modern software development techniques has been neglected for a long time in the field of algorithm 
research. Accordingly, the software developed mostly lacks certain properties required to be reused, 
e.g., robustness, adaptability, and ease of use. Object-oriented software technology provides a way to 
build reusable software, but does not guarantee reusability per se. Especially the reuse of algorithms 
may be regarded as “Still a Challenge to Object-Oriented Programming” [23]. 

One approach primarily devoted to the goal of achieving a higher degree of reuse is the framework 
approach [3, 4, 13]. Taking into account that for the development of application systems for given 
domains quite similar software is needed, it is reasonable to implement such common aspects by 
a generic design and embedded reusable software components. Here, one assumes that reuse on a 
large scale cannot only be based on individual components, but there has to be to a certain extent a 
reuse of design. Thus, the components have to be embedded in a corresponding architecture, which 
defines the collaboration between the components. Furthermore, there must be explicit mechanisms 
for adaptations of these components to the specific application. 

A framework may be defined as a set of classes that embody an abstract design for solutions to 
a family of related problems (here, heuristics for discrete optimization problems). A framework 
provides us with abstract applications in a particular domain, that may be tailored for individual 
applications. A framework defines in some way a definition of a reference application architecture 
(“skeleton”). That is, frameworks provide not only reusable software elements but also some type 
of reuse of architecture and design patterns, which may simplify software development considerably. 
Thus, frameworks represent implementation-oriented generic models for specific domains. 

The flexibility of a framework is dependent on the specific possibilities for adaptation. As certain 
aspects of the framework application cannot be anticipated, these aspects have to be kept flexible, 
which implies a deliberate incompleteness of respective generic software components. These semi- 
finished components provide “holes” to be “filled” when building specific applications. Free [19] 
terms these variable aspects hot spots and calls a framework well- designed if it provides adequate 
hot spots for adaptations. Here, one may differentiate between white-box and black-box reuse or 
frameworks. Frameworks often start as white-box (inheritance-based) frameworks which are primarily 
customized and reused through subclassing. When the variable concepts have stabilized or been 
realized, it may be possible and appropriate to evolve to black-box frameworks, which are mostly 
adapted through static parameterization and composition [21]. There are several potential advantages 
of this adaptation by parameterization: run-time efficiency, decoupling of components, and ease-of- 
use without the need to understand internal details of components. One potential disadvantage of 
parameterizable components is the requirement of a solid understanding of the stable and flexible 
concepts of the domain [3]. In reality, of course, a framework generally does not embody a pure design 
concerning the properties described above, but uses different concepts and techniques in a pragmatic 
way. 
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3. Design of a framework for heuristic search 

While there are some well-known approaches for reusable software in the field of exact optimization,^ 
there is, as far as we know, no ready-to-use and well-documented component library in the field of 
local search based heuristics that meets the goals set above. However, we would like to point to the 
class library for heuristic optimization of [25], which includes both local search based methods and 
genetic algorithms, the framework for local search heuristics of [1], and the class library described in 
[9]. For reasons of space, an appropriate evaluation of the different approaches is not possible here. 

3.1 Domain analysis 

Building a non-trivial software system requires a solid understanding of the respective domain. Thus, 
it is essential to perform an appropriate domain analysis, which results in a model of the domain to 
be covered by reusable software with the constant and variable aspects of the domain. The domain 
to be captured here may be described in the following simplified, tight model for local search based 
heuristics. This model might appear rather trivial, but it is important to formalize the domain to be 
covered by reusable software in some way, to achieve a common understanding of the domain, which 
allows the software developers to start from reasonable assumptions. Here, we apply a “back-to-the- 
basics” approach and try to include the basic ingredients of the different concepts and neglect some 
subtleties, especially if these are difficult to describe within the space limitation here. Further, here 
we restrict the discussion to concepts related to the meta-heuristics simulated annealing and tabu 
search.^ 

There are different types of discrete optimizations problems, P, each with different problem instances, 
p e P. There may be different ways to obtain these problem instances; e.g., reading from a file, reading 
from a data base, using a problem generator, by coupling with a software system that provides the 
data, etc. For every problem instance p there may exist one or more solution spaces, 5p, with solutions, 
s e Sp. If unambiguous we may neglect the index p. As a “solution” for a given type of problem may 
be an incomplete solution in the sense that not all decision variables are set, we may use more than 
one solution space. For example, a solution for the Steiner problem in graphs may be constituted by 
the actual Steiner tree or by a selected subset of the Steiner nodes that may be treated as required 
nodes. Furthermore, we may or may not include infeasible solutions in solution spaces S. Infeasibility 
generally affects the evaluation function, f : S T. A fully ordered set F corresponds to a simple 
preference function of a decision maker. The objective is to find a solution s 6 5 with minimal 
evaluation /(s), or an evaluation as low as possible. (Of course, maximization problems may easily 
be transformed into minimization problems.) 

For each solution space, 5, there are one or more neighborhood structures, N, that define for each 
solution s e 5 a linear ordered set of neighbors N{s) = {rii(s), . . . , n|iv(5)|(5)}. To each neighbor 
n(s) € N{s) corresponds a move /x(s, n{s)) that captures the transitional information from s to n{s). 

^Consider, e.g., mathematical programming software like CPLEX (http://www.cplex.com), ABACUS, a framework 
that incorporates cutting plane and column generation methods for combinatorial optimization problems (cf. [22]), 
and class-libraries for constraint logic programming such as ILOG Solver (http://www.ilog.com/). 

^With regard to these heuristic methods, we refer to the descriptions given in [12, 10, 5]. 
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To make use of tabu search, one must store information about the search history (trajectory), which 
requires the notion of move (solution) attributes. A move /x is an aggregation of 

|/i”| negative move attributes /if, ... , /if € ^ 

and 

l/i'^l positive move attributes /ii", . . . , /^jji+|, /i^" € ^, 

where ^ is the set of attributes that imply, and are implied by, the neighborhood. The negative 
(positive) move attributes correspond to those properties of the current solution that are “destroyed” 
(“created”) when the move under consideration is performed. With this, tabu criteria may be defined 
in a fiexible way. 

3.2 Framework design 

In the following we briefly describe a framework for heuristic search (first discussed in [8] under 
the acronym HotFrame, Heuristic OpTimization FRAMEwork). Intended users of the framework 
are both software developers that aim to develop optimization software to be used in practice and 
researchers in the various fields applying meta-heuristics. Before describing the framework in more 
detail, a discussion of the general concept is given. We use abstraction as the primary mechanism 
to build adaptable components. That is, common behavior is factored out and grouped in generic 
classes, applying static type variation that enables one to keep respective concerns flexible. A generic 
component should be expressed in a way that assumes as little as possible about the specific type 
on which it is acting. The specific application, i.e. adaptation, of generic classes by defining (fixing) 
the type parameters is called instantiation. For example, a local search component may be parame- 
terized by the neighborhood used. Specific neigborhood types, which may be used to instantiate the 
component as a specific application, must meet a number of requirements defined be the generic com- 
ponent. This may be called conceptual inheritance: all type parameters of the component represent 
specializations of such an abstract type with respective syntactical and semantical requirements. 

In C++, generic classes are defined by using the template mechanism (sometimes termed as static 
polymorphism), which enables one to leave certain types and values unspecified until the code is 
actually instantiated and used (compiled). With this, in connection with traditional object-oriented 
mechanisms such as encapsulation, etc., adaptation of concerns such as the representation of data, 
the implementation of algorithms, and the tabu criterion used is possible. 

Naturally, this means that the basic data types on the low level have to be parameterized. Denoting 
such a type parameter by adding it enclosed by “<. . . >” to the respective classes may, e.g., result 
in a template class such as X_P<T, Range> that represents problem instances for a specific problem 
type A, with the type T defining the data type of the cost coefficients of the problem (e.g., integer 
or floating point) and the type Range defining the maximum number of objects. However, more 
important is the parameterization of the heuristic classes by problem dependent types. For steepest 
descent, this leads to two important type parameters: the solution type and the neighborhood type. 
For example, the instantiation of a generic steepest descent component for a sequencing problem 
with a shift neighborhood may be defined as 
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SteepestDescent< Sequencing< T, Range >» 

ShiftNeigborhood< T, Range > > . 

As a more advanced example, consider using an object-oriented data base to persistently store the 
search trajectory of a reactive tabu search component, with simulated annealing used as escape 
mechanism. Then, we may instantiate a respective component as 

TabuSearch< Sequencing< T, Range >, 

ShiftNeigborhood< T, Range >, 

React iveTabuCr it erion< 

Sequencing< T, Range >, 

ShiftNeigborhood< T, Range >, 

EdgeOrientedMoveAttributes< T, Range >, 

ExactSolutionInfo< T, Range >, 

PersistentTrajectory< T, Range > >, 

SimulatedAnnealing<Sequencing< T, Range >, 

ShiftNeigborhood< T, Range > > > . 

Summarizing, the approach followed here is to extensively use static polymorphism in this way. Thus, 
the framework is primarily based on generalization by genericity (by heavy use of templates), and 
uses inheritance only in a secondary way. Potential advantages of this design approach are run-time 
efficiency, a decoupling of components, less “intrusiveness” , and possibly a better way to reuse more 
granular components, what might lead to appropriate flexibility, while partially achieving the ease 
of use of black-box reuse. For example, the architectural design described below provides appropri- 
ate flexibility on the problem side to enable reasonable implementation variations. Inheritance is 
essentially used to factor out common behavior for similar problems. For example, the similarities 
between different sequencing problems and corresponding neigborhood structures can be represented 
in corresponding class hierarchies. With that, the internal representation of a sequence or different 
neighborhoods (shift, swap, inversion, etc.) are coded only once, whereas specific functionality such 
as the evaluation of a solution is implemented in corresponding subclasses. 

The architectural design covers especially the separation into components, and the definition of a pre- 
defined component collaboration including the syntatical interface and the semantical requirements 
to be met. Naturally, our principal problem is, how to separate the problem side from heuristics. If 
it is possible to define a universal interface between these concepts, the number of implementations 
needed is reduced to an additive (instead of a multiplicative) combination of the number of problems 
and the number of heuristics. According to the domain model discussed above, the abstract problem- 
specific types involved from a specification perspective (with X representing the problem type) would 
be a problem class X_P and one or more solution classes X.S for all local search based heuristics, 
and eventually some additional classes for advanced heuristics. For example, various tabu search 
heuristics may require for each solution class X_5 one or more corresponding neighborhood classes 
X_5JV, for each neighborhood class JA_5JV one or more classes XJSJi, which encapsulate the tran- 
sitional information of a move /x as corresponding move attributes, and possibly for each solution 
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class XJ5 corresponding classes XJSJ, which encapsulate information about solutions (e.g., exact 
or hash codes). These types are visualized by the simplified class diagram in Figure 1, which includes 
the relevant associations and the most important functionality to be provided by the corresponding 
types (as specialized methods). 




Figure 1: Class diagram of problem-specific types. (M denotes an aggregation of class-specific type 
parameters.) 

As an example, we briefly discuss the design of the neighborhood iterator classes, which constitute the 
crucial dependency between the solution classes and heuristic classes. The applied concept is similar 
to the iterator concept that has been successfully applied to separate objects in containers from 
algorithms that operate on this data. What functionality has to be provided by such a class? First, 
we need a way to construct such a neighborhood iterator instance that represents an appropriately 
defined first neighbor for a given solution. Then, we need a way to iterate to the next neighbor, and 
to check whether we have reached the last neighbor. Last, we require the evaluation of every such 
neighbor. The neighbors are, of course, only constructed virtually by a reference to the solution and 
the storage of the transitional information. 

Following the design decision discussed above, we have to realize the type variation for the conceptual 
types by parameterizing the heuristic components by the problem-specific aspects as follows: 

• SteepestDescent< 5, 5JV > 

• SimulatedAnnealing< 5, 5 JV, AcceptanceF unction, Cooling > 
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• TabuSearch< 5, 5 JV, TabuMemory > 

The tabu memory parameterization is dependent on the criterion used. Referring to [5] for a more 
detained description of the tabu criteria considered here, the static tabu criterion is based on move 
information (an appropriately defined inversion of performed moves is prohibited for some time), the 
basic idea of the strict tabu criterion is to provide necessity and sufficiency with respect to the idea 
of not revisiting any previously visited solution, while reactive tabu search combines these two ideas 
by adapting the tabu list length of static tabu search exploiting trajectory information. 

• StaticTabuCriterion< 5, S' JV, > 

• StrictTabuCriterion< S, SJV, SJ> 

• ReactiveTabuCriterion< S, SJV, S-A, SJ > 

With such a kind of parameterization, we may reuse the heuristic components just by changing 
the appropriate types, without having to know about the internal structures. It is important to 
emphasize, that all components should, if possible, provide appropriate default behavior. For example, 
the simulated annealing component should be callable without setting any parameters. Then, the 
parameters are automatically set to appropriate default values (self-adaptation). 

The two primary concerns that are reused here, are, first, the different heuristics incorporated, but, 
equally important, the predefined component collaboration between different components, especially 
the interface between heuristics and different problem-specific components. As the framework users — 
the developers of application-specific software — may orient on the framework architecture, they are 
freed from some of the harder tasks; ideally, one must only add some basic, well-defined functionality. 
Thus far, the framework has been successfully applied for several problems, e.g., the Steiner tree 
problem in graphs, the multiprocessor scheduling problem, the multi-constrained (multi-dimensional) 
knapsack problem, pattern sequencing problems [6], continuous fiow-shop scheduling problems [7], a 
ring network design problem [5], and ring load balancing problems. The application of the framework 
to different types of problems serves three primary goals. First, it provides a test of the correctness 
and applicability of the framework, especially with respect to design changes. Then, the applications 
serve as examples of how to adapt and reuse the components of the framework. Last, some results 
of computational experiments may be of interest by themselves. By using such a framework, we get 
a way to do simple and fair comparisons of different heuristics. One may use the same application 
skeleton and just change the strategies (i.e., the components) to be compared. This meets some of 
the criticism given in [2, 11], and allows one to do controlled, unbiased experimentation. 

4. Conclusions and future directions 

Designing and implementing reusable software components is possible. Building a component library 
and reusing and adapting these components to specific applications is not effortless (“Nothing is 
for free”). Nevertheless, this approach may be regarded as the primary way to enable the broad 
application of the methods under consideration in practice. It is true though, that we disregarded 
some further practical requirements, such as the integration into superordinate application systems 
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in connection with, e.g., mechanisms that allow the user to intervene in the search process. However, 
such functionality may be included in the component library without invalidating the general design 
approaches presented here. 

We should note, that even if we have reusable software components for heuristic search, the success 
of respective implementations still depends on appropriate definitions of the concepts for the specific 
problem. That is, reusable components support the development of specific heuristic methods, where 
the success generally depends on the meaningful definition of several concerns and the efficient 
implementation of the problem-specific parts. 

Future research should include extensions in various respects, e.g., new heuristics, parallel algorithms, 
environment for experimentation, interfaces according to component architectures like OMG-CORBA 
and MS-COM+, documentation, and experimentation of/with the concepts mentioned and discussed 
above. This should also include the application of novel concepts from the software engineering field 
such as generative programming, which may simplify the application of reusable software components; 
in this context, declarative approaches deserve attention, too. 
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Ein verteilt-paralleles Tabu Search- Verfahren fur 
Containerbeladeprobleme mit schwach heterogenem 

Kistenvorrat 

Prof. Dr. Hermann Gehring, FernUniversitat Hagen 
Dr. Andreas Bortfeldt, FernUniversitat Hagen 

Zusammenfassung: Gegenstand des Beitrags ist die Entwicklung eines verteilt-parallelen Tabu 

Search- Verfahrens (TSV) fiir Containerbeladeprobleme mit schwach heterogenem Kistenvorrat. Das 
Verfahren basiert auf dem Konzept der kooperativen Autonomie, d.h. es kooperieren mehrere au- 
tonome Prozesse bei der Problembearbeitung. Jeder der Prozesse realisiert ein sequentielles TSV 
zur Losung des betrachteten Beladeproblems; die Prozesse kooperieren durch den Austausch von 
Losungen. Die Qualitat des verteilt-parallelen TSV wird anhand einiger Referenzprobleme aus 
der Liter atur getestet, wobei Containerbelade verfahren verschiedener Autoren zu Vergleichszwecken 
herangezogen werden. 

1. Einfiihrung und Problemformulierung 

Im Bereich des gewerblichen Giiterverkehrs kann der Einsatz von standardisierten Transportmitteln 
wie Container und Paletten durch computergestiitzte Beladeverfahren wirksam unterstiitzt werden. 
Betrachtet wird hier das Problem der Beladung eines Containers bei schwach heterogenem Kistenvor- 
rat, d.h. einem Kistenvorrat, der sich durch eine relativ geringe Anzahl verschiedener Kistentypen 
und eine relativ grofie Anzahl von Kisten pro Kistentyp auszeichnet (vgl. hierzu [3], [7]). Dieses 
Beladeproblem lafit sich kurz wie folgt formulieren: 

Gegeben seien ein einzelner Container und eine Menge dreidimensionaler rechteckiger Giiter, nach- 
folgend als Kisten bezeichnet. Die Menge der Kisten sei schwach heterogen. Zu bestimmen ist eine 
zulassige Anordnung einer Teilmenge aller Kisten in dem Container derart, dafi das verst ante Kisten- 
volumen maximiert wird und samtliche Restriktionen fiir die Beladung eingehalten werden. 

Eine Kistenanordnung wird zul^sig genannt, falls alle verst auten Kisten vollstandig und ohne 
Uberlappung in dem Container plaziert sind und falls sie parallel zu den Containerwanden liegen. 
Von der Vielzahl der in der Praxis auftretenden Restriktionen der Containerbeladung werden die 
Stabilitats-, die Orientierungs-, die Uberstapelungs- und die Gewichtsrestriktion (vgl. hierzu [2], [3], 
[5], [7]) in die Problemformulierung einbezogen. 

Fiir Containerbeladeprobleme wurden in den vergangenen Jahren verschiedene sequentielle Losungs- 
ansatze entwickelt, so u.a. problemspezifische Heuristiken (vgl. [1], [2], [8], [10]), Graphensuchver- 
fahren (vgl. [3], [9]), genetische Algorithmen (vgl. [3], [7]) und Tabu Search- Verfahren (vgl. [4], [5]). 
liber die Anwendung von Parallelverfahren auf diese Problemklasse wurde in der gesichteten Literatur 
bislang nicht berichtet. Im vorliegenden Beitrag wird daher ein Parallelansatz zur Losung des Con- 
tainerbeladeproblems mit schwach heterogenem Kistenvorrat vorgestellt. Der Ansatz umfafit mehrere 
autonome Tabu Search- Verfahren (TSV), die durch den Austausch von Losungen kooperieren. Diese 
autonomen TSV stellen unterschiedliche Konfigurierungen eines von den Autoren an anderer Stelle 
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beschriebenen sequentiellen Tabu Search- Verfahrens fiir Containerbeladeprobleme dar (vgl. [4], [5]). 
Der Rest des Beitrags ist wie folgt organisiert: In Abschnitt 2 wird die Konfigurierung des genannten 
sequentiellen TSV und in Abschnitt 3 der aus mehreren kooperierenden sequentiellen TSV beste- 
henden Parallelansatz beschrieben. Die Qualitat des Parallelansatzes wird in Abschnitt 4 anhand 
einiger Referenzprobleme aus der Literatur demonstriert. 



2. Sequent ielles Tabu Search- Verfahren 

Um unterschiedliche Konfigurierungen des dem Parallelansatz zugrundliegenden sequentiellen TSV 
darstellen zu konnen, werden zunachst einige grundlegenden Eigenschaften des sequentiellen TSV 
beschrieben. Das sequentielle TSV besteht aus einer sogenannten Basisheuristik, die im Rahmen 
einer iibergeordneten Tabu Suche zur Erzeugung von Losungen des Containerbeladeproblems einge- 
setzt wird. Die Basisheuristik beladt einen Container Schritt fiir Schritt. In jedem Schritt wird ein 
Packraum, am Anfang der Container, mit einer lokalen Anordnung befiillt. Lokale Anordnungen 
stellen einfache Arrangements von Kisten dar. Fiir einen Packraum existieren in der Regel viele 
zul^sige Anordnungen, die in einer Anordnungsliste absteigend nach einer gewissen Giitebewertung 
vorgehalten werden. Nach dem Fiillen eines Packraums wird der nicht genutzte Teil des Packraums 
vollstandig in mehrere residuale Packraume zerlegt; letztere werden je in einem weiteren Schritt 
befiillt. Im Rahmen des iibergeordneten Tabu Search- Verfahrens wird die Basisheuristik zur Erzeu- 
gung von Nachbarschaftslosungen eingesetzt. Eine zulassige Losung wird hierbei durch einen Vektor 
s, genannt Packplan, reprasentiert. Die ip-te {ip = 1,2,...) Position des Packplans ist dem zp-ten 
fiillbaren Packraum zugeordnet; ein Vektorelement s{ip) enthalt unter anderem einen ganzzahli- 
gen Anordnungsindex ia, welcher die Position der fiir den ip-ten Packraum zu benutzenden lokalen 
Anordnung in der zugehorigen Anordnungsliste identifiziert. Durch die Veranderung des Anord- 
nungsindex fiir genau eine Anderungsposition des Packplans gelangt man zu einem neuen Packplan; 
er repr^entiert eine Losung in der Nachbarschaft der urspriinglichen Losung. Demnach konnen 
durch unterschiedliche Vorschriften zur Auswahl von Anderungspositionen ip* und zur Modifikation 
der zugehorigen Anordnungsindizes s{ip*).ia verschiedene Nachbarschaften eines gegebenen Pack- 
plans s definiert werden. 

Nach diesen Vorbereitungen kann nun auf die Konfigurierung der sequentiellen TSV eingegangen 
werden. Unter dem Begriff der Verfahrenskonfigurierung werden hier die die Festlegung von Ver- 
fahrensparametern, die Wahl eines Diversifikationskonzeptes und die Vorgabe von Terminationskri- 
terien subsumiert. 

Zur Begrenzung des Suchaufwandes werden zwei Verfahrensparameter eingefiihrt. Der erste Para- 
meter Maxao beschrankt die Anzahl der pro Packraum zu beriicksichtigenden lokalen Anordnungen. 
Der zweite Parameter Nhlim beschrankt dariiberhinaus die zulassigen alternativen Anordnungs- 
indizes ia* fiir eine Anderungsposition ip* des Packplans wie folgt: 

- s{ip*).ia\ < nao{ip*) / Nblim^ (1) 
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wobei nao(ip*) die bei der Transformation des Packplans s ermittelte Lange der Anordnungsliste fiir 
die Position ip* bezeichnet. Die angegebene Ungleichung ist fiir Nblim = 1 stets erfiillt; fiir Nblim > 
1 beschrankt sie die zugelassenen alternativen Anordnungen ia* auf solche, die bewertungsmafiig in 
der Nahe der urspriinglichen Anordnung s{ip*).ia liegen. 

Neben den beiden genannten Parametern werden noch weitere vier Verfahrensparameter eingefiihrt: 

- Die Anzahl der zu berechnenden Iterationen Maxiter. 

- Die Nachbarschaftsstruktur Nbmodus mit den Auspragungen ’’grofi” und ”klein”. 

- Die Bewertungsvariante fiir lokale Anordnungen Evalmodus mit den Auspragungen ”min” und 
’’max”; wahrend die Auspragung ”min” nur ein Bewertungskriterium umfafit, kommen bei der 
Auspragung ’’max” noch zwei weitere Kriterien hinzu. 

- Die maximale Lange der Tabuliste Maxtabu. 

Zur Diversifikation des Verfahrens wird der gesamte Suchprozefi in Maxphasen verschiedene Such- 
phasen untergliedert, wobei die Suche in jeder Phase ausgehend von derselben Startlosung und mit 
leerer Tabuliste beginnt. Die Diversifikation wird erreicht, indem jeder Phase ein unabhangiger Satz 
der angegebenen sechs Verfahrensparameter zugrundegelegt wird. 

Die Termination des gesamten Verfahrens wird durch zwei alternative Terminationskriterien bewirkt: 
Die maximale Anzahl Maxphasen der zu durchlaufenden Suchphasen und die Rechenzeitschranke 
Maxzeit. Das Verfahren terminiert, sobald nach der vollstandigen Berechnung einer Iteration eines 
dieser beiden Terminationskriterien erfiillt ist. Als Verfahrenslosung wird die fiber alle Phasen beste 
Losung bestimmt. 



3. Verteilt-paralleles Tabu Search- Verfahren 

Fiir die Parallelisierung wurde ein eher grobkorniger Ansatz auf der Programm- bzw. Coarse Grain- 
Ebene gewahlt, dem das Modell der kooperativen Autonomie zugrunde liegt. Demnach umfafit der 
Ansatz mehrere autonome Prozesse, die durch den Austausch von Informationen kooperieren. Jeder 
der autonomen Prozesse wird einer Workstation in einem lokalen Netzwerk (LAN) zugewiesen. 

Bei der Losung eines gegebenen Beladeproblems kooperieren Nproz (Nproz > 1) Prozesse. Jeder 
Prozefi stellt eine Instanz des sequentiellen Tabu Search- Verfahrens dar und lost das komplette 
Problem. Allerdings werden die einzelnen Prozesse unterschiedlich konfiguriert, wobei jedoch alle 
Prozesse dieselbe Anzahl von Maxphasen Suchphasen durchlaufen. 

Jeweils am Ende einer Suchphase tritt ein Prozefi in eine Kommunikation mit den anderen Prozessen 
ein. Die Kommunikation umfafit pro Prozefi zwei Schritte: 

(1) Ein Prozefi stellt seine aktuelle Bestlosung, d.h. die beste wahrend der bisherigen Suche von 
dem Prozefi gefundene Losung, fiir die anderen Prozesse bereit. 
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(2) Der Prozefi ubernimmt eine Premdiosung, d.h. eine von einem anderen ProzeB bereitgestellte 
Losung, und beriicksichtigt diese Losung gegebenenfalls bei der weiteren Suche. 

Demnach vollzieht sich die Kooperation der Prozesse durch den Austausch von Losungen. Der 
Losungsaustausch erfolgt mittels eines Kommunikationsobjekts, welches Losungen der Prozesse auf- 
nimmt und den Prozessen zur Verfiigung stellt. Das Kommunikationsobjekt wird mittels einer Daten- 
bank realisiert. Seine Organisation hangt von dem verwendeten Kommunikationsmodell, d.h. der 
Art der Kommunikationsstruktur, ab. Als Kommunikationsmodell sind alternativ ein Ring und ein 
Blackboard vorgesehen (vgl. [13]). 

Im Falle des Kommunikationsmodells Ring wird die Datenbank des Kommunikationsobjektes in 
Nproz Bereiche eingeteilt. Jeder Bereich ist einem Prozefi fest zugeordnet und als Schlange gemafi 
dem FIFO-Prinzip organisiert. Ein Prozefi legt seine Losungen grundsatzlich nur in seinem Daten- 
bankbereich ab. Eine neue Losung wird somit stets am Ende des Bereichs angefiigt. Die Nproz 
Prozesse werden auf definierte Weise zu einem Ring verkniipft, so dafi jedem Prozefi genau ein 
Vorganger zugeordnet ist. Am Ende einer Suchphase liest daher ein Prozefi die im Datenbank- 
bereich seines Vorgangers aktuell am Schlangenanfang befindliche Losung, die nach dem Lesen aus 
der Schlange entfernt wird. 

Bei dem Kommunikationsmodell Blackboard wird die Datenbank des Kommunikationsobjektes als 
einheitlicher Bereich organisiert, in dem alle Prozesse ihre Losungen ablegen und aus dem sie Fremd- 
losungen entnehmen. Die in der Datenbank abgelegten Losungen werden als Stapel gemafi dem 
LIFO-Prinzip verwaltet. Eine von einem Prozefi bereitgestellte Losung wird also stets als oberstes 
Element in den Stapel eingefiigt. Ferner liest ein jeder Prozefi stets die oberste Losung der Daten- 
bank. Anders als bei einem Ring wird eine gelesene Losung nicht entfernt; sie bleibt vielmehr fiir 
den Zugriff durch andere Prozesse bis zum nachsten Schreibvorgang verfiigbar. Wegen der geringen 
Kommunikationshaufigkeit ist das bestandige Anwachsen des Stapels tolerierbar. Da die Prozesse im 
Laufe der Suche im allgemeinen Losungen von besserer Qualitat finden, ist es durchaus beabsichtigt, 
dafi nur die jeweils zuletzt eingefiigte Losung fiir Lesezugriffe verfiigbar ist. 

Unabhangig von dem verwendeten Kommunikationsmodell verfahrt ein Prozefi mit einer gelesenen 
Fremdlosung stets wie folgt: 

(1) Falls die Fremdlosung keinen besseren Zielfunktionswert aufweist als die aktuelle Bestlosung 
des Prozesses, so wird sie ignoriert und der Prozefi beginnt die Suche in der nachsten Phase 
erneut mit der Startlosung. 

(2) Andernfalls beginnt der Prozefi die Suche in der nachsten Phase mit der iibernommenen 
Fremdlosung, die somit die Startlosung ersetzt. 

Ein zusatzliches Merkmal des Parallelisierungsansatzes besteht darin, dafi einzelne Prozesse nicht 
an der phasenweise durchgefiihrten Kommunikation beteiligt werden. Ein solcher Prozefi fiihrt zwar 
analog zum sequentiellen Fall eine isolierte Suche durch, die von ihm ermittelte Bestlosung wird 
abschliefiend dennoch in die Ermittlung der Verfahrenslosung einbezogen. Die Verfahrenslosung wird 
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nach der Beendigung der Suche durch alle Prozesse stets als die beste von der gesamten ProzeBgruppe 
gefundene Losung ermittelt, unabhangig davon, ob etwa einzelne Prozesse von der Kommunikation 
ausgeschlossen wurden. 

Die Konfiguration des verteilt-parallelen Tabu Search- Verfahrens erfordert, iiber die Konfigurierung 
gemafi Abschnitt 2 hinaus, die folgenden Festlegungen: 

- Die Definition der Anzahl Nproz aller am Verfahren beteiligten Prozesse einschliefilich der 
event uell nicht an der Kommunikation teilnehmenden Prozesse. 

- Da Festlegung, ob und welche Prozesse von der Kommunikation ausgeschlossen werden. 

- Die Wahl des Kommunikationsmodells, Ring oder Blackboard; ferner im Falle eines Rings die 
Definition der konkreten Ringstruktur. 

4. Berechnungsergebnisse 

Das sequentielle TSV und das verteilt-parallele TSV wurden in C implementiert. Zur Durchfiihrung 
der Prozefikommunikation wurde das Datenbanksystem BTRIEVE verwendet. Als LAN- Work- 
stations standen Pentium Personalcomputer mit einer Taktfrequenz von 200 MHz zur Verfiigung. 

Um die mit dem verteilt-parallelen TSV erreichbare Losungsqualitat zu priifen, wurden bekannte 
Referenzprobleme aus der Literatur auch unter Einbeziehung anderer Verfahren fiir Containerbe- 
ladeprobleme berechnet. Bei den Referenzproblemen handelt es sich um die 700 Testprobleme von 
Bischoff und Ratcliff [2], die in 7 Testfalle zu je 100 Problemen unterteilt sind. Bei diesen 7 Testfallen 
variiert der Charakter der Kistenmengen von schwach heterogen bis stark heterogen. Jedes der 700 
Probleme schliefit lediglich eine Orientierungsrestriktion ein. Angemerkt sei in diesem Kontext, dafi 
unabhangig von dem zu losenden Problem das TSV stets eine Stabilitatsrestriktion beriicksichtigt. 

In einigen Experiment en von begrenztem Umfang wurden Anhaltspunkte fiir die Verfahrenspara- 
meter gewonnen. Gewahlt wurde folgende konkrete Konfigurierung: 

- Anzahl der am Verfahren beteiligten Prozesse Nproz = 4. 

- Unabhangig vom Kommunikationsmodell werden nur die ersten drei Prozesse in die Kommu- 
nikation einbezogen, wahrend der vierte Prozefi aufierhalb der Kommunikation mitlauft. 

- Die Kommunikationsmodelle Ring und Blackboard werden alternativ benutzt; im Falle des 
Rings bilden die ersten drei Prozesse aufsteigend nach den Prozefinummern geordnet eine 
Ringstruktur; der Vorganger des Prozesses 1 ist daher der Prozefi 3. 

- Die Werte der Parameter, die fiir alle vier Prozesse jeweils identisch gewahlt wurden, gehen 
aus Tabelle 1 hervor. 

- Eine Differenzierung der Prozefikonfigurationen wird ausschliefilich mittels des Parameters 
Nblim vorgenommen; dieser erhalt die Werte Nblim = 2, 3,4, 3 fiir die Prozesse 1, 2, 3, 4. 
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Tab. 1: Gemeinsame Parameterwerte der Prozesse des parallel- verteilten TSV. 



Pha- 


Phasenbezogene Parameter 


Phaseniibergreifende Parameter 


sen 


Maxiter 


Nbmodus 


Evalmodus 


Maxao 


Maxtabu 


Maxphasen 


Maxzeit [sec] 


1 


50 


klein 


max 


100 


50 


4 


500 


2 


50 


klein 


min 


100 


50 


4 


500 


3 


10 


grofi 


max 


50 


10 


4 


500 


4 


10 


grofi 


min 


50 


10 


4 


500 



Fiir die 700 Testprobleme von BischofF und Ratcliff liegen Berechnungsergebnisse fiir die Heuristiken 
von Bischoff et al. [1], BischofF und Ratcliff [2] sowie den genetischen Algorithmus (GA) von Gehring 
und Bortfeldt [7] in der Literatur vor. Sie sind zusammen mit den Berechnungsergebnissen fiir das 
sequentielle TSV und das verteilt-parallele TSV in Tabelle 2 aufgefiihrt. Als Berechnungsergebnisse 
des sequentiellen TSV wurden die mit dem Prozefi 4 ermittelten Resultate verwendet. Pro Testfall 
wird jeweils in Klammern die Anzahl der Kistentypen fiir die Probleme des Testfalls angegeben. 



Tab. 2: Berechnungsergebnisse fiir die 700 Testprobleme von Bischoff und Ratcliff. 



Testfall 


Bischoff 
et al. 


Bischoff k 
Ratcliff 


Gehring & 
Bortfeldt 
(GA) 


Bortfeldt k 
Gehring 
(TSV) 


Gehring k 
Bortfeldt 
(Par. TSV/ 
Ring) 


Gehring k 
Bortfeldt 
(Par. TSV/ 
Blackboard) 




Volumen- 


Volumen- 


Volumen- 


Volumen- 


Volumen- 


Volumen- 




auslastung 


auslastung 


auslastung 


auslastung 


auslastung 


auslastung 




(in %) 


(in %) 


(in %) 


(in %) 


(in %) 


(in %) 


BRl (3) 




83,79 






89,73 




BR2 (5) 




84,44 


■ilalMI 




89,58 


89,62 












89,21 


89,26 














88,42 














IIIIIIIIIII^ESIII^ 




















82,01 










Mittel 


81,97 


83,45 


87,51 


87,07 


88,09 


88,17 



Mit dem Kommunikationsmodell Blackboard werden, wie die Tab. 2 zeigt, etwas bessere Resultate 
erzielt als mit dem Ringmodell. Die nachfolgenden Analysen beziehen sich daher auf die Berechnun- 
gen mit dem Blackboardmodell. 

Fiir alle 7 Testfalle erzielt das verteilt-parallele TSV deutlich hohere Volumenauslastungen als die 
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beiden Heuristiken von Bischoff et al. [1] bzw. BischofF und Ratcliff [2]. Gemittelt fiber alle 700 Prob- 
leme betragt die absolute Verbesserung 6,20 % bzw. 4,72 % des Containervolumens. Im Vergleich 
zu dem sequentiellen TSV wird gemittelt fiber alle 700 Probleme eine Auslastungsverbesserung von 
1,1 % des Containervolumens erreicht. Schliefilich schneidet das verteilt-parallele TSV anders als 
der sequentielle TSV im Mittel fiber alle 700 Probleme auch besser ab als der GA von Gehring und 
Bortfeldt [7]. Allerdings betragt jetzt die Auslastungsverbesserung nur noch 0,66 % des Container- 
volumens. Zudem ffihrt der GA bei den letzten drei Testfallen mit starker heterogenem Kistenvorrat 
zu besseren Ergebnissen als das verteilt-parallele TSV. Dies ist eigentlich nicht fiberraschend, da 
das TSV speziell ffir schwach heterogene Kistenmengen konzipiert wurde; nur in diesem Fall stehen 
hinreichende Kistenanzahlen pro Kistentyp ffir die Bildung von effizienten lokalen Anordnungen zur 
Verffigung. 

Von Interesse ist die Prage, warum das verteilt-parallele TSV im Vergleich zu dem sequentiellen TSV 
nicht wesentlich besser abschneidet. Von verschiedenen Autoren (vgl. z.B. Nissen [11], Crainic et 
al. [6]) wurde darauf hingewiesen, dafi sich die Parallelisierung von TSV zum Zweck der Steigerung 
der Ergebnisqualitat als schwieriger erweist, als die den gleichen Zweck verfolgende Parallelisierung 
von GA. Dies erscheint zumindest dann plausibel, wenn, wie im vorliegenden Fall, eine Kommu- 
nikation zwischen Prozessen den Austausch von Losungen und deren Nutzung als Ausgangspunkte 
der weiteren Suche vorsieht. Denn ein allzuhaufiges ’’Springen” im Suchraum kann einer intensiven 
Erkundung begrenzter Bereiche des Suchraums entgegenwirken. Wird andererseits, wie hier, die 
Kommunikation stark begrenzt, so kann keine grofiere Verbesserung der Losungsqualitat gegenfiber 
der sequentiellen Suche erwartet werden. Dies gilt um so mehr, wenn, wie im vorliegenden Fall, die 
durch ein sequent ielles TSV erzeugten Losungen bereits von relativ hoher Qualitat sind. 

Crainic et al. [6] berichten gar, dafi die besten Resultate bei der Parallelisierung eines TSV ffir 
ein Mehrgfiterstandortproblem erreicht wurden, als auf eine Kommunikation der Prozesse vollig 
verzichtet wurde. Die hier gewonnenen Erfahrungen gehen nicht ganz so weit: Die Verbesserung 
der Ergebnisse mit dem verteilt-parallelen TSV im Vergleich zu dem sequentiellen TSV um 1,1 % 
wurde primar durch die unterschiedliche Prozefikonfigurierung erzielt. Synergieeffekte durch die Pa- 
rallelisierung des TSV spielten zwar eine sekundare Rolle, waren aber immerhin in geringem Umfang 
feststellbar. 
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Summary: We present a new approach to the numerical computation of (higher) moments of 

the exit time distribution of diffusion processes. The method relies on a linear programming (LP) 
formulation of a process exiting from a bounded domain. The LP formulation characterizes the 
evolution of the stopped process through the moments of the induced occupation measure and leads 
to an infinite dimensional LP problem on a space of measures. Our functional method yields finite 
dimensional linear programs which approximate the infinite dimensional problem. An important 
aspect of our approach is the fact that excellent software is readily available. 

We shall illustrate the method by looking at some examples. We shall specifically report on numerical 
computations and results related to change-point detection methods for drifted Brownian motion. 
Procedures for detecting changes in these models, such us CUSUM, EWMA and the Roberts-Shiryaev 
procedure, can be described in terms of one-dimensional diffusion processes and lead to exit time 
problems. Important operating characteristics for all of these procedures are the mean exit time and 
higher moments of the exit distributions of the diffusion processes. 



1. Introduction. In this paper we describe the main ideas of a numerical method for computing 
the moments of exit time distributions of diffusion processes. An interesting feature of the method 
for the OR community is the fact that it is based on linear programming. We shall illustrate the 
method by looking at two examples in greater detail. We shall specifically report on numerical 
results related to geometric Brownian motion, a process important in mathematical finance, and to 
the Roberts-Shiryaev process, a process important in the context of change-point detection. 

The method is a specialized version of the linear programming formulation for controlled Markov 
processes which, for the discrete setting, was developed by Manne [9], and extended to very general 
settings by e.g. Bhatt and Borkar [2], Kurtz & Stockbridge [8], Hernandez- Lerma et al. [7] and others. 
The key observation which underlies the LP-formulation is that the evolution of a very lg,rge class of 
Markov processes can be described in terms of a system of equations which specify the occupation 
measure of the process. For controlled Markov processes this characterization of the evolution of 
the process leads to an infinite dimensional LP-problem, the feasible points being measures on the 
product space of the set of states (of the process) and the set of possible control values. For the 
special case of exit problems for (uncontrolled) diffusions this approach reduces to the identification 
of a feasible point. In the case of “no-control” the objective function of the LP problem is just an 
“auxiliary” tool. 

The main characteristics of the method are (1) to describe (and approximate) occupation measures 
through its moments, (2) to characterize feasible moment sequences for exit time distributions from 
bounded domains by linear equations involving the generator of the process and a chosen class of 
test functions, and linear inequalities derived from the Hausdorff conditions for (moment) sequences 
of measures supported on the unit interval, the unit square, etc. (see e.g. [5] and [3]). 

The paper is organized as follows. In section 2 we give some theoretical results which are the 
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foundation of the method. In section 3 we illustrate the method by looking at geometric Brownian 
motion. We compare the numerical results based on the LP-approach with the exact values. In 
section 4 we analyze one of the detection schemes referred to above, the Roberts-Shiryaev process, in 
some detail. This process is closely related to geometric Brownian motion. Thus geometric Brownian 
motion can be used to calibrate the numerical method for computations concerning the R-S process. 



2. Theoretical results. In light of the examples to be discussed below and in order not to 
overburden the presentation with technicalities we restrict our discussion to exit problems of finite 
dimensional diffusions from a bounded interval. Due to translation and scaling properties of mono- 
mials in ]R^ we could assume w.l.o.g. the interval to be the hypercube. We shall further assume that 
all exit times to be considered have a finite mean value. The following result is thus an immediate 
consequence of the martingale characterization of diffusion processes, e.g. [4] or [11], the optional 
sampling theorem and the Helley-Bray theorem. 

Theorem 1. Let {Xt)t>o be a solution of the martingale problem for a diffusion operator A starting 
at xq e at time to > 0. Let r denote the first exit time of the time-space process {Xt,t)t>to from 
a bounded ‘‘interval” G = I x [0, T], I C Assume that E[r] < oo. Then there exist a finite 
measure po on G^, the interior of G, and a probability measure on dG, the boundary of G, such that 
for every function f : x SV' M which is on IR^ and on IR^ the adjoint equation 



f f{x,t)dni{t,x) - f{to,xo) - f %+’A.f 
JdG JG^ Ot 



{Xjt)dfjLo{x,t) = 0 



( 1 ) 



holds. The probability measure p,\ is the joint distribution of the exit location and the exit time of 
the process {Xt/^T,t AT) t>to, and po is its occupation measure. 

The converse to theorem 1 was proved in [8] in a very general setting. The following result is a 
special case of this general theorem. We keep on using the notation introduced above. 

Theorem 2. Let po be a finite measure on and let pL\ be a probability measure on dG which 
satisfy the adjoint equation (1). Then there exists a stochastic process {Yt)t>to starting at xq at time 
to which is adapted to a filtration of a -fields {F^} and an {Ft} -stopping time f such that for every 
test function f (see above) 

ftAf [ 9 / 

(.) [^+^/ 

(ii) p[{Y{f),f)€dG]=l, 

and 

(Hi) I (Vt,t) e dG,t < fj has Lebesgue measure zero. 

Moreover, for each Borel subset Tq C G^ and Fi C dG 



{Ys,s)ds in an [Ft] -martingale, 
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[jr"/r„(y„s)ds] 


= /lo(ro), 


(2) 


£?[/r.(y(f),f)] 


= Mi(ri). 


(3) 



The adjoint equation (1) specifies uncountably many constraints for the two measures fio and fii. 
For our numerical method we shall choose a finite set of test functions instead. To be specific, we 
characterize the measures /xq and /xi through their moments and therefore use monomials f{x,t) := 

Xi^ • • ni H- h rid = n > 0, m > 0 up to a fixed order M 6 iV. The reason for choosing 

monomials is that we are going to exploit the linear structure of the Hausdorff conditions (see below). 
For instance, if /xq is a finite measure on the unit interval and if M is chosen to be 3 we are going to 
work with sequences 2 : = {zq, Z 2 , zs) which satisfy the inequalities 



(i) > 0, 0 < i < 3, 

(ii) Zi — Zi+i >0, 0 < i < 2, 

(iii) Zi - 2zi+i + Zi+2 >0, 0 < i < 1, 

(iv) zo - 3zi + 3z2 - Z 3 > 0 . 




Figure 1: The set of Hausdorff sequences of order 3 

Figure 1 shows the set of such admissible sequences in if zq = 1, the case of approximating 
“moment sequences” of probability measures on [0,1] of order 3. This polytope has 4 corner points. 
Figure 2 illustrates the set of admissible sequences (ljZi,Z 2 ,Z 3 ) which satsify an additional linear 
equation, cf. the adjoint equation (1). 
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Figure 2: The set of Hausdorff sequences of order 3 with an additional equality constraint 

In general, for any finite M the “finite dimensional Hausdorff conditions of order M” involve the 
iterated differences of a sequence (^o> - 2 : 1 , . . . , zm) and are, in the 1-dimensional case, of the form: For 
every m, 0 < m < M, and k, 0 < k < M — m, 

i-irA"^z(k) := ± > 0; (5) 

i=o \ J ) 

see Section 3 for the 2-dimensional conditions. In the next two sections we shall illustrate how finitely 
many equality constraints on sequences (zq, ^i, . . . , zm) derived from the adjoint equation (1) together 
with the linear inequalities (5) provide a viable method to compute exit time distributions. To this 
end it is important to exploit the inversion formula for a distribution function in terms of iterated 
moment sequences, cf. [5] for the 1-dimensional case and [3] for the higher dimensional case. To 
illustrate the point, let us consider the 1-dimensional case. Given a sequence (zq.zi,. . zm) which 
satisfies (5) we place the mass 



( 6 ) 

at the point k/M, 0 < k < M, and use the resulting mass distribution as an approximation to a 
scaled version of the exit distribution. 



3. The LP-approach for geometric Brownian Motion. Let (Xt)t>o denote geometric Brownian 
motion starting at xq, i.e. (JVt) is the solution of the stochastic differential equation 



dXt = aXt dt 4- aXt dWu ^(0) = xq > 0. 



( 7 ) 



235 



We shall call a the drift coefficient and a the diffusion coefficient of this process. Geometric Brownian 
motion is the classical model for risky assets and is at the heart of the Black-Scholes formula, cf. eg. 
[1]. To compute higher moments of the exit time of (Xt)t from the interval (0, 1) we need to consider 
the generator A of the time-space process VJ = ^ > 0> i e- for smooth functions f{x,t), 

Af{x, t) = + Y 

Let T = Txq denote the first exit time of {Xt) from (0, 1) when the process starts at xo, 0 < xo < 1. 
We shall compare the numerical values of E[t% n > 0, obtained by using the LP-approach with 
the exact values derived from the following analytical formula for the Laplace transform of r (see 
Proposition 1 below); we have used Mathematica to do the necessary differentiation when computing 
moments, see Table 1 below. 

Proposition 1. Let (Xt)t>o be a geometric Brownian Motion with drift parameter a and diffusion 
coefficient cr, {a — <J^/2) > 0, starting at xq, 0 < xq < 1, and let r = inf{t > 0 | Xt = 1}. The 
Laplace transform of r is given by, A > 0, 



E[e = exp [in(xo) (yXA-{- B - J?)] , (9) 

where A = ^ and ~ • 

Proof. Since (Xt) is a solution of the stochastic differential equation 

dXt = aXtdt A cXtdWu X(0) = xq, 



we get 



Thus, for every /x > 0, 



Xt = xo exp 






t + aWA 



© 



exp 






= exp[-M(a-y)*-^< . 



Since E^At] < oo, = 1 and since [^exp fJxrWt — ^—^t ^ is a martingale we obtain, 

\ ^ \J t>0 



x^o=E[e-^^], 

where 

2 

A:=/iO:-y(/i-/a^). (10) 

We obtain formula (3) when we solve Eqn. (10) for // > 0 in terms of A. 



O 
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Instead of doing computations directly for Xt we shall analyze the transformed process Zt = 
(-ln(Xt),t) and employ a time and space truncation technique to reduce the exit time problem 
for (Zt) to one from a bounded domain. Since (Xt) solves Eqn.(l), ^t '= -ln(A't) equals drifted 
Brownian, i.e. 

d^t = -(a-j) dt + ad i-Wt) , ^(0) = - log (xo), 

and 



'^xo = inf{t|^t = 0}. 

Since (^t) lives on (0, oo) we approximate by 

= inf { t I (^t) exits from (0, 6)} , 

b “large” (depending on the coefficients a and a). 

Since 

= inf 1 1 I ^ exits from (0, 1)| 
we shall henceforth consider the process rjt where 

vio) = ^0- 

Since we need to consider the time-space process (r/t, t) we shall also truncate the time coordinate 
at some “large” value T. Typical values for b and T which we have used for the case a = 2 and 
= 1 are 6 = 8 and T = 10. The actual choice of b and T is dictated by the requirement the 
probability of not exiting from (0,6) x (0, T) to be “small”. Using monomials of the form n, 
m > 0, X G [0, 1], t G [0,T], as test functions and calling z[n, m] the moments of the occupation 
measure of (r]t,t) on (0,1) x (0,T), zo[m], Zi[m] and ZT[m] the moments of the “exit” distribution 
on {0} X [0,T], {1} X [0,T] and [0, 1] x {T}, resp. the adjoint equations become, /i = -1 (a - ^), 
to = 0, m, n > 0, 



^ r ^ 1 ~ r r 1 

0 = -m z[m - l,n\ 2 :[m,n- 2 ] - n fx z[m,n - 1] 



-i'o'Ho + zo[m] + Zi[m] + ZT[n], 



( 11 ) 



where values for negative indices are defined to be zero. 

To these linear equality constraints we add the linear inequality constraints due to the Hausdorff 
conditions for z, zq^ z\ and ztj viz. for all 0 < n, m < M, i < M — n, j < M — m 



EE (;) (7) (-ir 



T' 



^ (ikj 



> 0 , 
> 0 , 



( 12 ) 



(13) 
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and, V € {0, 1}, 

The (approximated) m-th moment of the exit time r is thus given by 



zi{m) + zo{m). (15) 

The following numbers for the moments of the exit time for geometric Brownian motion were obtained 
using the interior point solver of the Cplex package restricting M to 12 and using as an “artificial” 
objective function a scaled sum of all the variables. Similar numbers are obtained when using the dual 
or the primal solver of Cplex. Note, that the primal solver requires the largest number of iterations 
for this kind of problems. Since the moments of geometric Brownian motion vary over a large range 
depending on the choice of a, e.g. « 3.7 • 10^^ if a = 1, cr^ = 1, and « 3.9 • 10“^ for 

a = 10, = 1, it can be seen from Table 1 that the moments computed using the LP-approach 

compare well with the exact values for the moments (listed in columns headed by E(xact) V(alues) 
below). 





Xq - 


= 0.2 


Xq = 


= 0.5 


Xq = 


= 0.8 


Moment 


LP 


EV 


LP 


EV 


LP 


EV 


0 


1.00000 


1.000000 


1.000000 


1.000000 


1.000000 


1.000000 


1 


1.07296 


1.072959 


0.462098 


0.462098 


0.148762 


0.148762 


2 


1.62811 


1.628111 


0.418912 


0.418912 


0.088247 


0.088247 


3 


3.40601 


3.406048 


0.657223 


0.657223 


0.120955 


0.120955 


4 


9.44191 


9.443340 


1.549950 


1.549947 


0.270741 


0.270741 


5 


33.26220 


33.300460 


4.962390 


4.962398 


0.844981 


0.844982 


6 


143.25000 


144.073391 


20.180500 


20.180559 


3.385860 


3.385919 


7 


727.11000 


742.695628 


99.719300 


99.720156 


16.569500 


16.572083 


8 


4185.14000 


4456.993374 


580.459000 


580.470151 


95.732300 


95.824745 



Table 1: Moments of the exit time distribution of geometric Brownian Motion with 

drift Q! = 2, variance = 1 and various initial values. We have used 
the interior point algorithm provided by Cplex, using space truncation 
at 6 = 8 and time truncation at T = 10 (for xq = 0.2), T = 11 (for 
xq = 0.5) and T = 12 (for Xq = 0.8). 



4. The LP-approach for the Roberts-Shiryaev process. The Roberts-Shiryaev process {Xt)t>o 
is the solution of the stochastic differential equation 



dXt = (1 + 2Xt) dt + V2Xt dWt, A:(0) = xq = 0. 
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It is an operating characteristic of a detection scheme, cf. eg. [10], for the following basic continuous 
time model of a constant drift term embedded in Brownian noise. Let {^t)t be the process 



d^t = pl{t>e}dt H- adWt, 



where, in the simplest case, the parameters p and a are known constants. The unknown parameter 
6 specifies the beginning of the effect of the constant drift term p. Without loss of generality we 
may assume that ^ = L The Roberts-Shiryaev detection scheme is the procedure which sounds an 
alarm, indicating a change of the drift term, at the first time the statistic (Xt) hits a given threshold 
value 6, cf. [10]. In [6] we have used the LP method to numerically evaluate the first moment of 



T = mf{t\Xt>b}. 



Here we compute higher moments of r. Note that the R-S process is very much like geometric BM, 
so we have used the latter problem, for which exact formulas for the moments are known, to calibrate 
our method for the former problem. 

Besides the linear inequalities due to the Hausdorff conditions the adjoint equations become using 
time truncation at a chosen value T, 0 < n, m < M, 



0 = —mz[n,Tn — l] — n(n — 1) ^[n, m] — 2nz[n,m] — ri2;[n — l,m] 
- 0 "* • 0 ” + 6 ” zi[m] + ZT[n], 



(16) 



where z[n, m] denotes the moments of the occupation measure on (0, b) x (0, T) and Zi[m], ZT[n] resp. 
the moments of measures on the boundary {6} x [0, T] and [0, b] x {T} resp. 

Table 2 shows values for the first moments for various threshold values 6. The following formula for 
the first moment of r, cf. [10], 




can be easily evaluated. This formula together with other simple estimates on higher moments 
of r show that the LP approach provides accurate results for the case under consideration. Other 
detection schemes like the CUSUM and the EWMA procedure, cf. [10], can also be decribed in terms 
of one-dimensional diffusion processes. They also lead to exit time problems and can be analyzed in 
the same way as the Roberts-Shiryaev process. 
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Moment 


1 


2 


3 


4 


5 


0 


1.000000 


1.000000 


1.000000 


0.999998 


0.999982 


1 


0.597304 


0.933651 


1.191100 


1.568470 


1.602940 


2 


0.416417 


1.058040 


1.773620 


3.284260 


3.549480 


3 


0.342734 


1.466020 


3.316280 


8.926700 


11.609900 


4 


0.334072 


2.457870 


7.681630 


29.455300 


50.678200 


5 


0.384111 


4.909770 


21.592600 


110.254000 


247.368000 


6 


0.517888 


11.629800 


72.314300 


455.4710000 


1236.630000 


7 


0.850480 


32.633900 


284.562000 


2036.870000 


6144.720000 


8 


2.135710 


105.543000 


1290.580000 


9569.360000 


30286.500000 


9 


5.538060 


359.755000 


6352.490000 


46211.100000 


147204.000000 


10 


7.096700 


1134.580000 


29272.400000 


216009.000000 


683986.000000 



Table 2: Moments of the exit time from an interval (0,6), 6 = 1, ... ,5, for the 

Roberts-Shiryaev process with drift value a = 2, variance = 2 and 
initial value 0. We have used a time-truncation-level of 10 and employed 
the dual solver of the Cplex package. 



5. Implementation issues. The LP problems which arise as a result of our method are highly 
structured. We therefore recommend to use matrix generator languages like AMPL and GAMS to set 
up the linear programs. Employing such an interface greatly simplifies the testing of modifications of 
the basic problems. For instance, instead of only using (approximating) moment sequences as vari- 
ables we have also run programs using all iterated differences as unknowns. While this dramatically 
increases the number of variables our test runs seem to indicate that problems with “very large” M 
can be handled this way. 

Another issue which we addressed is the kind of solver to use. As far as packages are concerned 
we have tested the routines provided by CPLEX, NAG and SAS, all of which handled the problems 
described in sections 3 and 4 well. Due to the large number of constraints compared to the small 
number of variables when formulating the LP-programs using (approximating) moment sequences 
as variables the dual simplex algorithm requires fewer iterations than the primal one. To find a 
feasible point for the dual problem is easy while the first phase of the simplex algorithm for the 
primal problem typically requires several thousand iterations for medium size problems. 
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Recursive Direct Algorithms for Multistage Stochastic 
Programs in Financial Engineering 

Maxc C. Steinbach, ZIB, Berlin 

Summary: Multistage stochastic programs can be seen as discrete optimal control problems with a 
characteristic dynamic structure induced by the scenario tree. To exploit that structure, we propose 
a highly efficient dynamic programming recursion for the computationally intensive task of KKT 
systems solution within an interior point method. Test runs on a multistage portfolio selection 
problem demonstrate the performance of the algorithm. 

1. Introduction 

Multistage stochastic programs have become an important approach to model the process of decision 
making under uncertainty over a finite planning horizon. Important applications include, among 
others, financial engineering problems such as portfolio selection or asset and liability management. 
Multistage stochastic programs are considered a very hard class of optimization problems since their 
size can become excessively large even for coarse discretizations of the probability space and possibly 
the time horizon. Nevertheless, the characteristic structure of scenario trees makes these problems 
tractable by numerical algorithms. Among the most prominent ones are several variants of decompo- 
sition methods. Primal decomposition approaches [3, 6, 9] assign a small local optimization problem 
to every node and treat the vertical coupling between stages iteratively, by passing intermediate 
solutions up and objective and feasibility cuts down the scenario tree. Dual decomposition and 
progressive hedging algorithms [13, 15, 17] optimize individual scenarios and iterate on the nonantic- 
ipativity condition representing the horizontal coupling. Both groups of algorithms offer a significant 
degree of inherent parallelism [2, 4, 7, 18]. 

This paper presents a highly efficient method adopting the complementary strategy: as in [1, 5, 10], 
optimality and feasibility are achieved by an interior point iteration whereas the global coupling across 
the tree is treated explicitly. The distinguishing new component of our method is a recursive direct 
KKT solver that fully exploits the topological structure of the scenario tree. Rapid convergence is 
ensured by a simple but effective refinement technique that exploits the multistage stochastic nature 
of the problem. Due to space limitations, this technique will be presented in a forthcoming paper [21]; 
our emphasis here is on the inherent dynamic structure of multistage stochastic programs, which we 
view as discrete optimal control problems. Two formulations of dynamics are distinguished, explicit 
and implicit, and corresponding classes of tree-sparse quadratic programs (QP) are introduced. These 
purely equality-constrained QP represent simple, well-understood optimization problems in their 
own right, whose solution by direct application of dynamic programming naturally leads to the KKT 
recursions of interest. On the other hand, their structure is sufficiently general to catch the essentials 
of nonlinear stochastic optimization within interior point or sequential quadratic programming (SQP) 
methods. In fact, the KKT recursion generalizes a previous serial version that has been successfully 
applied in a direct SQP method for nonlinear trajectory optimization [22, 23, 24]. 

The remainder of the paper is organized as follows. After introducing a quadratic multistage portfolio 
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selection problem as an example, we formulate explicit and implicit tree-sparse quadratic programs 
as general problem classes, and present the recursive solution algorithms. Computational results on 
the example problem demonstrate the excellent performance of these algorithms. Finally we provide 
some conclusions and future directions of research. 

2. A multistage portfolio selection problem 

We consider a multistage extension of the mean- variance approach as introduced by Prauendorfer [8]. 
Assume that a portfolio manager can invest into n risky assets. After the initial investment at t = 0 
the portfolio may be restructured at discrete times t = 1, . . . ,T; it is redeemed one period later at 
time T+ 1. Denote by uj' the transaction volume in asset v at time t (positive for buying, negative for 
selling), by xj' the capital at time t after rebalancing, and by Vt^Xt € R” the vectors with components 
respectively. The initial wealth is normalized and fully invested, and subsequent transactions 
do not change the wealth. With e := (1, . . . , 1) these conditions read e*Xo = 1 and e*Vt = 0, t > 0. 
The objective is to minimize risk while achieving a prescribed expected return on investment. This 
yields the same efficient frontier as maximizing the expected return p for a given level of risk and can 
be seen as a multistage version of the classical Markowitz approach [11]. Other researchers formulate 
the portfolio selection problem as a stochastic network problem [12, 14] or model the tradeoff between 
profit and risk by a utility function [6]. 

A standard factor model is assumed for the random returns and associated risk: the risk driving 
factors are given by stochastic data ujt observed at time t = 0, . . . ,T. Denote by := (cjq, ■ • • 
the history of random events up to time t, by rt(cj*) G R” the vector of returns in period \t — l,t], 
by the expected return in period [T,T + 1] conditioned on o;^, and 

by ^ the associated covariance matrix. The decision at time t > 0 is made after 

observing cjf, yielding asset capitals xj' = r^{u^)x^_i + u{'. Clearly, this leads to a nonanticipative 
policy X = (xo, . . . ,xt), that is, decisions may depend on the past but not on future events. (Note 
that uo is observed before the initial decision and can thus be treated deterministically.) 

Risk is modeled as the variance of the expected return at T -h 1, which can be written as 

{rj-+i(w^+')*xr(w^)} = E„r {xr(w^)* [Et(w^) + fr(w^)fT(‘^^)*] - p^- (1) 

For numerical computations, the stochastic evolvement of risk driving factors is approximated by a 
discrete set of scenarios^ that is, sequences of events. Given a partial event history a;*, we distinguish 
only finitely many possible outcomes for the next observation This branching process creates 
a scenario tree rooted in the deterministic initial event ojq. We denote by Lt the level set of nodes 
representing event histories up to time t, by L = Lt the set of leaves, and by V := U^o 
complete vertex set. In the following we use j e V as node variable and denote by j = 0 the root, 
by 7r(j) the predecessor (parent) of node j and by S{j) its set of successors (children). The path 
to node j is a partial scenario with a probability pj; thus the probabilities sum up to one in each 
level set Lt, t = 0, ..., T, and pj is the product of probabilities on the path to j. The tree structure 
reflects the growing amount of information: two scenarios that are indistinguishable up to time t 
share the path (the same information up to some node j G Lt but may follow different branches 
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when the next event Ut+i is observed. 

Assume that returns f* and associated covariance matrices Ejb are given (j e V,k e L = Lt). 
Dropping the constant — in (1), the portfolio selection problem can then be written as the QP 

imn 53 x)\pj{T,j + fjf;)]xj 

jeL 

s.t. e*Xo = 1, 

e'Xj = rjx^(j) Wj eV\ {0}, 

'Z,P}rjXj=p. 
jeL 

Note that the transaction volumes do not appear here: transition equations (4) simply state that the 
net value of the portfolio remains unchanged when it is restructured. The same condition determines 
the first component vj uniquely if the “control vector” Uj := . . . , up* is given. Defining 

E := ("p € ho := Q 6 R", 

and Gj := Diag(r], . . . ,rp, hj := 0, j e V \ {0}, we can thus replace (3,4) by explicit transition 
equations xj = GjXjr(j) + Euj + hj (with x,r(o) € R°). The complete QP reads now 



min 




(6) 


’ 


jeL 




s.t. 


Xq = Euq + ho. 


(7) 




Xj = GjXjr(j) + Euj Vj € \ {0} , 


(8) 




Y^PjrjXj = p. 


(9) 




jeL 





For a given scenario tree, both the implicit and the explicit QP represent deterministic equivalents 
of the verbally formulated stochastic program. The explicit form clearly expresses their nature as 
optimal control problems. 

Realistic asset allocation problems typically include various inequality constraints imposed by the 
investor and/or the market, a cash account (with riskless, deterministic return), transaction costs [6], 
and often cash deposits and withdrawals. While the implicit form above is well suited and more 
efficient for our simple model problem, the explicit form can easily be extended to include such 
model refinements. Details will be given in a forthcoming paper. 

3. General problem classes 

In this section we introduce two general problem classes, the tree-sparse quadratic programs, based 
on explicit and implicit dynamic equations, respectively. The explicit case, as a natural extension 
of deterministic optimal control problems, is very similar to the problems introduced in [16]. The 
implicit form is often used in financial applications but has not yet received much attention from 
an optimal control viewpoint. A detailed study of both problem classes, the associated recursive 
algorithms, and the relation of explicit and implicit formulations will be given in [20]. 



( 2 ) 

(3) 

(4) 

(5) 
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3.1 Dynamic structure 

In the following we consider dynamic systems on arbitrary trees. For the sake of brevity, the parent 
node of j will now be denoted by z = 7r(jf). 

Explicit dynamics. Associated with each node j e V are a state vector Xj, a control vector Uj, 
and an (affine) transition mapping pj. Every state xj depends on its preceding state Xi = x-j^^j) and 
on control Uj through explicit dynamic equations 

~ Pjip^ii '^j) — GjXi + EjUj + hj, j 6 V. (10) 

To avoid a special notation for the root j = 0, we formally assume that x,r(o) is zero-dimensional and 

not associated with any node. Every control vector u = {uj}j^v uniquely determines a state vector 
X = {xj}. Thus the control variables represent all degrees of freedom in the dynamic system. 
Implicit dynamics. As in the portfolio problem, a more compact formulation of dynamics is often 
possible. Associated with each node is again a state Xj but no control, and only part of Xj depends 
on the preceding state via implicit dynamics 

Pjl^Xi, Xj) = GjXi + hj PjXj ~ 0) j ^ (11) 

Here the transition mapping pj maps into a space with smaller dimension than Xj, and Pj is required 

to have full rank. Thus (11) leaves some local degrees of freedom (“controls”) hidden in Xj. 

3.2 Tree-sparse quadratic programs 

We now formulate the quadratic programs which can be seen as discrete linear-quadratic optimal 
control problems: the aim is to find states and (possibly hidden) controls that minimize the objective 
while satisfying the dynamic equations and additional, problem-dependent linear constraints, the 
latter usually representing boundary conditions. The quadratic objective is typically a sum of local 
contributions, that is, it satisfies certain separability properties. This characterizes the tree-sparse 
case; only the linear objective term and additional constraints may introduce global coupling across 
the tree. 

Explicit tree-sparse QP. The tree-sparse quadratic program with explicit dynamics takes the 
general form 




EiDi Fj)h]+ey = 0. (14) 

jev V'j/ 



Note that the linear objective terms and constraints involve node variables Uj, Xj, whereas the mixed 
quadratic terms couple Xi, Uj along edge (z, j). These mixed terms may arise as second derivatives of 
nonlinear transition mappings in general optimization problems, hence the different contributions Hij. 
We define Hj := Hj -\-'Z,k£S{j) collect pure quadratic terms associated with Xj, and reformulate 




245 



the objective by substituting Hj for and zero for Hij. Potentially the boundary conditions (14) 
couple all decision variables globally across the tree. Note, however, that initial conditions may be 
modeled separately by suitable choice of the root transition mapping po- 

Regularity assumption. For QP (12-14) we require that (a) the constraints (13,14) have full rank, 
and (b) the objective is strictly convex on the null space of dynamic equations (13). Condition (b) is 
slightly stronger than usual: in addition to guaranteeing a unique minimizer, (a) and (b) also ensure 
that the QP can be solved by the recursive tree-sparse PH method developed below. 

Implicit tree-sparse QP. The tree-sparse quadratic program with implicit dynamics has the general 
form 





(15) 


jev ^ 




s.t. GjXi -f- hj = PjXj Vj e V, 


(16) 


H FjXj +ev = 0. 


(17) 


iev 





Regularity assumption. For QP (15-17) we require more restrictive conditions than before: (a) the 
constraints (16,17) have full rank, (b) the objective is strictly convex on the full space, i.e., every 
Hj is positive definite. These conditions ensure that the tree-sparse SC method can be applied to 
compute the unique minimizer. 



4. Recursive solution 

Under the respective regularity assumptions stated above, both the implicit and explicit tree-sparse 
QP have unique minimizers that can be obtained through recursive solution of the KKT conditions. 
Basically these recursive algorithms proceed as follows. In a leaf, all the local variables can be 
formally eliminated. This modifies only the KKT data associated with the parent node, leaving a 
similarly structured KKT system on the subtree without the leaf. Thus leaves are cut off in an 
inward recursion until the root is eliminated, and a positive definite system determines the global 
multiplier p. Node variables are then computed in an outward recursion. The inward recursion 
actually defines a direct symmetric factorization of the KKT matrix and transformation of the right 
hand side by one factor, while the outward recursion corresponds to the adjoint transformation. 
Details will be given in [20] along with proofs that only positive definite blocks are being inverted; 
here we restrict ourselves to an algorithmic description of the recursions. In both cases we introduce 
artificial zero blocks in the KKT conditions. These appear in positions that are modified during the 
recursion (fill-in), so they might contain nonzero entries without invalidating the algorithms. 



4.1 Recursion for the explicit QP: tree-sparse PH method 

The Lagrangian of the explicit tree-sparse QP reads 



L{u,x,X,n) = 






Ea; 

jev 






ijev V'jJ 



+ ev\ 
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Equating partial derivatives with respect to all variables Uj, Xj, Xj, fj, to zero and introducing Xy := 0 
yields the (linear indefinite) system of KKT conditions 



JjXi 4- HjUj — EjXj — DjjjL 4" dj — 0 


Vj € F, 


(18) 


HjXj-h E E Gix,-F;pL+fj = ^ 


Vi e V, 


(19) 


kes(j) kesu) 






GjXi 4- EjUj - Xj hj = 0 


Vi 6 V, 


(20) 


^^(DjUj 4- FjXj) 4- Xyp, 4- cv = 0. 




(21) 



j£V 



Consider a leaf j all of whose siblings are also leaves, S{i) C L, and observe that 5(j) = 0 in (19). 
Prom (20) and (19) we obtain immediately 

Xj — GjXi + EjUj + hjj Xj = —HjXj + — fj. (22) 

Substituting the expression for xj into (21) yields the local modification (in {z} U 5(i)) 

^ + ^j^j) + + ev = FiXi H- ^j'^3 ^v\5(») (23) 

jes{%) jes{i) 

where Fi := Fi 4- Ej FjGj, Dj := Dj + FjEj, and ey\ 5 (i) := ey + Ej Fjhy Similarly, substitution of 
Xj = -HjGjXi - HjEjUj 4- - fj, fj := fj 4- Hjhj into (18) and the parent’s equation (19) yields 

JjXi + KjUj — DjjjL 4- dj = 0, HiXi 4- K — Ff fi fi = 0, (24) 

jes{t) 

respectively, where Jj := Jj + EjHjGj, Kj := Kj-\-EjHjEj, dj := dj-¥Ejfj, Hi := Hi-\-J^jG*jHjGj, 
and fi := fi 4- Ej ^jfj- From the first equation of (24) one obtains now 

Uj = -kj\JjXi - D*jfi + dj). (25) 

Substituting that expression into the second equation of (24) and into (23) restricts the KKT system 
to the subtree with vertex set V\S{i), 

HiXi Xi - Ff fi -h fi = 0, FiXi 4* + ^v\s{t) = FiXi-\- Xy\s(i)fJ> + ey\s(i). 

Here the modified data are Hi := Hi — Ej Jjkf^Jj, Fi := Fi - Ej F)jkf^Jj, fi := fi — Ej Jjkf^dj, 
Xy\s{i) := Xy + Ej and ey\ 5 (i) := ey\s{i) - Ej ^jkf^dj. 

Repeating this process recursively creates the positive definite system X^fi = -em after eliminating 
the root node. Using the solution /i and formulae (25,22), the node variables Uj, Xj, Xj (in that order) 
are then calculated in the outward recursion. Xy — 0 may be replaced by any symmetric positive 
semidefinite block (of suitable dimensions) or, more generally, by any symmetric block yielding a 
positive definite X^. 

The elimination of Xj, Xj is a projection of the KKT system onto the null space of the local dynamic 
equation, while elimination of Uj is a local minimization using the projected Hessian kj. Thus 
each recursion step represents a local version of the projected Hessian (PH) method (or null space 
method), so we call the recursion a tree-sparse PH method. See [20, 22] for more details. 
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4.2 Recursion for the implicit QP: tree-sparse SC method 
The Lagrangian of the implicit tree-sparse QP is 

L{x, + fjxj] - X) ~ ■ 

jev ^ jev Ljev J 



After introducing Xy := 0 and Yj := 0, Zj := 0 , j € V, the KKT conditions read 

HiXj + P*Xi- Y, GlXH-F;n + fi = Q Vi€V, ( 26 ) 

*€sy) 

GjXi - PjXj + YjXj - Z]n + hj = Q Wj e V, (27) 

■” ZjXj) H- XvfJ> + Cv = 0 . (28) 

jev 

Again we consider a set of leaves S{i) £ L with common parent z, so that 5(j) = 0 in (26) and 

xj = -Hr\p;x^-F;,i-^fj). (29) 

Substituting Xj into (28) yields the local modification 

51 i^j^j ~ ^j^j) + + ev = - 51 ^3^3 ^V\S{i)tJ> + ^V\Si%) (30) 



i€S(») jesix) 

where Zj := Zj + Ylj FjHf^Pj, Xv\s{i) := Xy + Y^jFjHj^Fj ^ and ey\s{i) ey — Y,jFjH~^ fj. 
Substitution of Xj into (27) gives 

GjXi -}- YjXj — ^^1^ hj = 0 

with Yj := Yj + PjH~^Pjj hj := hj + PjH~^fj, and consequentially 

Xj = -Yr\GjXi-Z;f,-^hj). (31) 

From that we obtain final modifications of (30) and of the parent’s equation (26), 

FiOCi — ^3^3 ■*" ^v\5(t)/^ + ^y\S{i) — FiXi -H Xy\s{i)fJ> + ey\s{i)^ HiX{ P* Xi — F * /i 4- /j = 0, 

ies(t) 

respectively, where the modified data are := Fi -I- Ylj ZjYf^Gj^ Xy\s{i) •= Xy\s{i) — 
ey\s{i) '= ^v\s{i) - Ej ZjY~^hj, Hi := Hi + Ylj GjYf^Gj, and fi := fi + Ej GjYj~^hj. 

The inward recursion again yields a system of the form X^^i = -C 0 defined on the empty tree, and 
node variables Xj^Xj are calculated (in that order) from equations (31,29). Xy,Yj may be replaced 
by arbitrary symmetric positive semidefinite blocks (more generally, any symmetric blocks leading 
to positive definite X 0 , Yj), and Zj by any rectangular blocks of suitable dimensions. 

The elimination steps (31) for Xj involve Schur complements Yj, and each recursion step represents a 
local version of the Schur complement method (or range space method). Hence we call that recursion 
a tree-sparse SC method. Again, see [20, 22] for details. 
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QP form 


stages 


scenarios 


nodes 


variables 


constraints 


matrix entries 


time 


explicit 


2 


81 


91 


1,365 


729 




0.01 


explicit 


3 


729 


820 


12,300 


6,561 




0.22 


explicit 


4 


6,561 


7,381 


110,715 


59,049 




2.11 


explicit 


6* 


48,000 


59,977 


899,655 


479,817 


25,430,010 


18.4 


implicit 


3 


729 


820 


6,560 


821 




0.06 


implicit 


4 


6,561 


7,381 


59,048 


7,382 




0.80 


implicit 


5 


59,049 


66,430 


531,440 


66,431 




7.21 


implicit 


6* 


217,728 


259,939 


2,079,512 


259,940 


23,394,504 


33.1 



Table 1: Sizes of KKT systems and solution times for tree-sparse recursions on balanced trees with 
9 branches per node. *Tree not balanced; largest problem fitting into memory (KKT data « 207 MB). 

5. Computational results 

As test problems we consider explicit and implicit formulations of the portfolio management problem 
with n = 8 assets. The scenario trees have nine branches per node, giving — l)/8 nodes and 9^ 
scenarios for the T-stage problem. In addition, two 6-stage problems that fill all available memory 
are constructed (with different numbers of branches). Benchmark runs are performed on a Silicon 
Graphics 02 workstation with a 175 MHz RIOOOO processor. The algorithms are implemented in C++ 
and treat all matrix blocks as dense and scenario-dependent. Table 1 lists the sizes of KKT systems 
and the CPU times in seconds. These data clearly demonstrate the efficiency of the recursions. 

6. Conclusions and extensions 

We have proposed a new method for multistage stochastic programs, based on a general strategy that 
is often pursued in nonlinear programming and clearly formulated in [22]: we treat nonlinearities 
and inequality constraints globally by suitable iterative algorithms (interior point or SQP methods), 
and exploit any problem-inherent structure on the linear algebra level. Clearly, that strategy has 
proved successful here due to the excellent performance of tree-sparse recursion in the critical task of 
KKT systems solution. In addition, convergence of the global iteration on inequalities is significantly 
enhanced by exploiting nonlinear characteristics of the problem through successive refinement [21]. 
Our method is not only conceptually elegant, it is also advantageous from a software engineering 
viewpoint. Since any relevant problem structure is concentrated in the KKT systems, one can apply 
generic iterative algorithms globally and still obtain high efficiency. For instance, linear algebra 
operations on subtrees are completely independent in the tree-sparse recursions, so they can easily 
be parallelized without affecting the interior point method. 

In this work we have only considered the basic recursive structure of dynamics. In applications, 
local constraints involving only one control Uj, state Xj, or pair (xi,Uj) may appear in many nodes. 
Such constraints can be eliminated by local projections; the details given in [22] (for the chain case) 
generalize immediately to the tree case, except for minor technical complications when the number 
of local constraints exceeds the number of local degrees of freedom. 
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With local constraints included, our view on multistage stochastic programs as optimal control 
problems with a recursive dynamic structure provides a general framework for classifying variables 
and constraints, and for treating them appropriately. It also provides a guideline to exploit not only 
the common dynamic structure of multistage stochastic programs but also the local sparse structure 
of every specific application. In the portfolio selection problem, for instance, matrices Gj,Ej are 
(almost) diagonal in the explicit QP, and matrices Ej are identical in all nodes for both QP forms. 
This can be used to save CPU time and memory simply by specializing the node operations. The 
more difficult situation of sparse blocks in the presence of local constraints can be dealt with efficiently 
in a number of financial engineering problems. In the general case, suitable sparse local projections 
have to be constructed, which requires highly sophisticated sparse matrix techniques and remains a 
promising subject of future research. 

We believe that the method proposed here is a significant step towards meeting at least one of the 
challenges formulated by Ruszczynski [19]: the development of specialized methods and high quality 
software for certain application areas. 
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MEPO, ein System zur efFektiven parallelen Bearbeitung 
von Optimierungsaufgaben 

Dipl.-Ing. Mathias Hadenfeld, TU Braunschweig 



Summary; 

An der TU Braunschweig wurde in den letzten Jahren das Softwarepaket MEPO zur effektiven 
Bearbeitung langrechnender numerischer Optimierungsaufgaben entwickelt. Austauschbare Mo- 
dule sind mit einer Kommunikationsstruktur auf Message-Passing-Basis verbunden. Uber flexible 
Schnittstellen lassen sich vom Benutzer neue Optimierungsroutinen und Giitefunktionen in Module 
einbetten und in das System integrieren. 

Genutzt werden von MEPO vernetzte Workstations und Rechner mit BatchzugrifF. Eine Paral- 
lelisierung ist sowohl auf der Ebene der Optimierer, wie auch der Giitefunktion moglich. Eine 
integrierte adaptive Lastverwaltung verteilt die Module und Tasks auf die eingebundenen Rech- 
ner und sorgt fiir kurze Iterations- Zeiten. Ausfalle von Modulen oder Rechnern werden von einem 
Checkpoint /Restart-System ohne Riickwirkung auf die Optimierungen abgefangen. 

Eingesetzt wird das System zur Zeit fiir Optimierungsaufgaben in den Ingenieurswissenschaften. 
Anwendungsbeispiele sind Tragfliigeloptimierungen bei der Daimler Benz Aerospace Airbus und dem 
DLR, die Entwicklung von Kernkraftwerks-Beladeplanen bei den Firmen Siemens und Scandpower. 



1. Einleitung 

Kiirzere Entwicklungszeiten sowie gestiegene Anforderungen an Qualitat und Leistung erfordern bei 
der Entstehung neuer Industrieprodukte in immer grofierem Umfang den Einsatz moderner Design- 
met hoden. Immer komplexere Planungs- und Produktionsablaufe sind nur noch mittels Software- 
unterstiitzung effizient zu gestalten. Die numerische Optimierung erlangt im Rahmen dieser Ent- 
wicklungen in vielen Bereichen der Ingenieurs- und Wirtschaftswissenschaften eine immer grofiere 
Bedeutung. 

Mit dem Begriff ’numerische Optimierung’ werden in den verschiedenen Fachdisziplinen eine Vielzahl 
unterschiedlicher Methoden und Verfahren bezeichnet. Der grofien Klasse der direkten Verfahren ist 
eine generelle Vorgehensweise gemeinsam. Eine Optimierungsroutine erzeugt bzw. variiert Ein- 
gangsdaten fiir eine Simulation oder Kalkulation des zu optimierenden Systems. Beispiele fiir solche 
Routinen sind Evolutionare Algorithmen, Gradientenstrategien oder Simulated-Annealing. Mittels 
eines Simulationscodes wird eine Qualitat ermittelt, die als Ergebnis an den Optimierer zuriickfliefit. 
Iterativ werden die Eingangsdaten bis zum Erreichen des Optimums verbessert. Die fiir die Umset- 
zung der Eingangsdaten in einen Qualitatswert verwendete Software wird dabei als Qualitats-, Ziel- 
oder Giitefunktion bezeichnet. 

In vielen Veroffentlichungen, z.B. [AT95], [DRA97], [DGL98], wird auf die Aufarbeitung des zu 
optimierenden Systems und die Auswahl und Anwendung einer geeigneten Optimierungsstrategie 
eingegangen. Wenig Aufmerksamkeit kommt dabei den vielen Vorbereitungen zu, die zu einem ar- 
beitenden Optimierungskreislauf fiihren. Es miissen Strategie und System gekoppelt, geeignete Ein-, 
Ausgabe- und Steuerungsmoglichkeiten integriert und die zu variierenden Eingangsgrofien ausgewahlt 
werden. In Forschungsprojekten verbrauchen diese Vorbereitungsarbeiten einen nicht unerheblichen 
Teil vorhandener Kapazitaten. Im industriellen Alltag kann die Erstellung von geeigneten Benutzer- 
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schnittstellen und die Anpassung der Software an neue Erfordernisse zur Hauptaufgabe werden. 

An der TU Braunschweig wurden verschiedene Projekte zur Optimierung durchgefiihrt [Ax94], 
[PAD97], [DRA97], [AHF96]. Um die Mitarbeiter von Standardarbeiten zu entlasten und weitere 
leistungsfahige Komponenten wie Parallelisierung zu integrieren, wurde wahrend der letzten sechs 
Jahre das Softwaresystem MEPO (Multipurpose Environment for Parallel Optimization) entwik- 
kelt. Es handelt sich um ein modular aufgebautes Softwaresystem, in das iiber flexible Schnittstellen 
beliebige Optimierungsroutinen und Giitefunktionen eingebunden werden konnen. Die umfangrei- 
chen Ein- und Ausgabe- sowie Debug-Optionen erlauben die Anwendung des Programmsystems 
als Entwicklungsplattform in der Forschung. Eine bedienungsfreundliche Benutzerschnittstelle und 
Fehlererkennungs- und Korrekturmechanismen machen es alltagstauglich fiir den Einsatz in der In- 
dustrie. Zielrichtung der Entwicklung waren langrechnende Parameter- und Anordnungsaufgaben 
in den Ingenieurswissenschaften. Das System eignet sich aber zur Bearbeitung beliebiger Opti- 
mierungsaufgaben mit direkten Verfahren. Die Hauptarbeit bei der MEPO-Entwicklung wurde am 
Institut fiir Flugmechanik und Raumfahrttechnik IFR der TU Braunschweig durchgefiihrt, flnanziert 
aus den Projekten RaWid und MEGAFLOW des nationalen Luftfahrtforschungsprogramms. 



2. Struktur von MEPO 



Das MEPO-System besteht aus vier unterschiedlichen Modulen, die verschiedene Funktionalitaten 
enthalten. Die Module sind parallel laufende Programme, die iiber ein Message-Passing-System 
Daten austauschen. Als Kommunikationssoftware wird PVM (Parallel Virtual Mashine) in der Ver- 
sion 3.3 genutzt [GBD94]. Der generelle Aufbau ist in Bild 1 dargestellt. Das System ist lauffahig 
auf alien gangigen UNIX-Systemen (HP-UX, IRIX, AIX, LINUX, SUN-OS etc.). 



c 



User Interface (Ul) 






( Optimization^ J J 
VModul^OM)>^ 



3 



^ Parallel Optirnization 



Interface (POI) 



Modelling ^ 
l^ule (MM) 

objective 
functi on jp 
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Module (AM) 

analyse 
i. function 



Bild 1: Struktur des MEPO-Systems 



CZD 



CZD 



MEPO-Modul 

Integrierter Benutzer-Code 
PVM-Kommunikation 



Eine Bedienungsoberflache (User Interface UI) und ein Steuerungsmodul mit der Bezeichnung Parallel 
Optimization Interface (POI) bilden die Basis von MEPO. Hinzu kommen ein Optimierungsmodul 
(Optimization Module OM) mit einer eingebetteten Optimierungsroutine sowie ein oder mehrere 
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Module mit der eingebetteten Giitefunktion. Notwendig ist ein Modul mit den Ein- und Aus- 
trittspunkten der Giitefunktion (Modelling Module MM). Optional konnen Teilaufgaben in ver- 
schiedene andere Module (Analysis Module AM) verlagert werden. 

Uber das UI kann der Benutzer das gesamte System verwalten und die Optimierung steuern und 
kontrollieren. Es beinhaltet einen bedienungsfreundlichen und fehlertoleranten Befehlsinterpreter 
mit einer umfangreiche Auswahl von Befehlsworten und Optionen. Ein- und Ausgabe kdnnen online 
Oder Datei-orientiert bei freier Formatwahl erfolgen. Vom UI wird das Steuerungsmodul POI ge- 
startet. Es initialisiert und iiberwacht die fiir die eigentliche Optimierung benotigten Module OM, 
MM und AM und koordiniert die gesamte Dateniibertragung. Weitere Aufgaben des POI sind die 
Rechnerverwaltung und die Lastverteilung bei Nutzung mehrerer Rechner. 

In die Module OM, MM und AM werden die Optimierungsroutine und die Giitefunktion integriert. 
Der Benutzercode wird mit MEPO-Bibliotheken zur Kommunikation, Datenspeicherung und Ver- 
waltung zu einem eigenstandigen, neuen Modul verbunden. Zur Verfiigung stehen Schnittstellen fiir 
FORTRAN- und C-Code. Die erzeugten Module werden gespeichert und wahrend der Systemlaufzeit 
in das System integriert. Uber das UI konnen die Komponenten fiir die Optimierung ausgewahlt und 
gewechselt werden. Aus den Modulen heraus lassen sich externe Programme aufrufen, z.B. kaufliche 
Oder unveranderliche Simulations- und Analysecodes. 

Die Erzeugung benutzerspezifischer Module ermoglicht die Flexibilitat von MEPO und die Eignung 
fiir beliebige Arten von Optimierungsaufgaben. Die Funktionalitaten in den Systemmodulen ergeben 
die hohe Leistungsfahigkeit und Sicherheit des Systems 

3. Abbildung der Grofien auf Parameter 

Giitefunktionen zur Bestimmung von Qualitaten fiir eine Optimierung haben im allgemeinen eine 
Vielzahl verschiedener Ein- und Ausgangsgrofien. Die Eingangsgrofien konnen in feste und von der 
Optimierungsroutine zu variierende Grofien aufgeteilt werden. Eine einzelne der Ausgangsgrofien 
wird als zu optimierende Qualitat ausgewahlt. Alle weiteren Grofien sind iiblicherweise fiir die 
Optimierung nicht von Bedeutung. 

Die Hauptaufgabe bei der Integration einer Giitefunktion ist die Zufuhr der Eingangsgrofien und die 
Weiterleitung der relevanten Ausgangsgrofie. In MEPO werden dazu samtliche Grofien unabhangig 
von ihrer Funktion auf ’Parameter’ abgebildet. Die Parameter haben eine ahnliche Funktionalitat 
wie Variablen einer Programmiersprache. Sie werden iiber einen eindeutigen Namen identifiziert 
und lassen sich mittels Indizes zu Vektoren und Arrays erweitert. Untereinheiten der Parameter 
sind ’Elemente’, die als eigentliche Datentrager fungieren. Sie enthalten einen Wert vom Typ Inte- 
ger, Real Oder String sowie Zusatzdaten fiir eine Optimierung wie Schrittweiten oder Grenzwerte. 
Parameter und Elemente werden im UI definiert und mit Werten belegt. Die Giitefunktion greift 
unter Verwendung der Namen und Indizes iiber Schnittstellenfunktionen auf die Elemente zu und 
kann die verschiedenen Daten ein- und auslesen. Entsprechend ihrer Funktion in der Optimierung 
wird den Elementen eine Bedeutung zugewiesen. Sie bestimmt die Verwaltung und die Zugriffsrechte 
in den einzelnen Modulen. Festen Eingangsgrofien bekommen die Bedeutung ’fix’, zu variierende 
Eingangsgrofien die Bedeutung ’design’. Die zu optimierende Ausgangsgrofie erhalt die Bedeutung 
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’objective’, alle weiteren Ausgangsgrofien werden auf ’free’ gesetzt. Die Urspriinge dieser Aufteilung 
finden sich in [Kr92]. 

Uber das UI lassen sich die Bedeutungen der Grofien variieren. Sollen in einer Optimierung einzelne 
Eingangsgrofien nicht mehr variiert werden, so werden sie von ’design’ auf ’fix’ gesetzt. Durch das Um- 
setzen von ’objective’ wird eine andere Ausgangsgrofie des Systems optimiert. Auf diese Weise kann 
man durch einfache Eingaben im UI eine einzelne Giitefunktion fiir eine Vielzahl unterschiedlicher 
Optimierungen verwenden, ohne Anderungen an der Integration der Giitefunktion durchzufiihren. 
Die Integration der Optimierungsroutine erfolgt entsprechend. Auch dabei werden die Ein- und 
Ausgangsgrofien auf Parameter abgebildet. Zusatzlich stehen Filterfunktionen zur Verfiigung, die 
die fiir die Optimierung relevanten Daten aus den Parametern extrahieren. Die Dateniibergabe an 
die Routine erfolgt dann mittels Vektoren und Arrays. 

4. Datentransfer und Aufteilung der Giitefunktion 

Fiir den Datentransfer zwischen den Modulen werden samtliche Parameter zu einem Datensatz 
zusammengefafit, der stets komplett weitergegeben wird. Beim Ubertragen an ein Modul werden 
Eingangsgrofien transportiert. Die Giitefunktion liest sie aus und bestimmt die Qualitat, die in den 
gleichen Datensatz zuriickgeschrieben wird. Dieser wird an das Ausgangsmodul zuriicktransportiert, 
wo das Ergebnis ausgelesen und verarbeitet werden. Eine Vervielfaltigung eines Datensatzes ist eben- 
falls moglich. Die sich ergebenden Datensatze werden an weitere Module zur Bearbeitung ubertragen. 
Der erste Datensatz mit der Startkonfiguration entsteht im UI durch den Benutzer. Er wird zum 
OM transportiert. Von der Optimierungsroutine werden dort die ’design ’-Elemente des Datensatzes 
variiert. Verschiedene Optimierungsstrategien, wie etwa die Evolutionaren Algorithmen, erzeugen 
mehrere Variationen gleichzeitig. Der Datensatz wird dann vervielfaltigt und jede Variation in 
einen eigenen Datensatz eingetragen. Sie werden einzeln zu den MM transportiert und verarbeitet. 
Mit den gleichen Datensatzen werden anschliefiend die Ausgangsgrofien der Giitefunktion zur Opti- 
mierungsroutine zuriicktransportiert . 

Der Datensatz kann universell zum Datentransport genutzt werden, er ist nicht auf Parameter der 
Giitefunktion beschrankt. So lassen sich mittels des Datensatzes etwa Einstellgrofien des Optimie- 
rers oder Pfadangaben fiir Aufrufe externer Programme ubertragen. Gleichfalls ist der Transport 
von Zusatzdaten aus der Giitefunktion moglich, wie z.B. Gradienteninformationen fiir die Opti- 
mierungsroutine. 

Lafit sich eine Giitefunktion in mehrere unabhangige Teilaufgaben aufspalten, so konnen diese in 
verschiedene einzelne Module, die Analysemodule AM, integriert werden. Im MM wird der zuge- 
sandte Datensatz modifiziert und vervielfaltigt. Die entstandenen Satze werden an die AM verteilt, 
die ihren jeweiligen Teil der Auswertung iibernehmen. Die Ergebnisse dieser Berechnungen werden 
im MM zur Ermittlung der Qualitat verwendet. Es konnen dabei verschiedene AM gleichzeitig oder 
nacheinander genutzt werden 

Diese Aufteilen der Giitefunktion bietet zwei Vorteile. Zum einen konnen im Rahmen einer Par- 
allelisierung Teilaufgaben gleichzeitig bearbeitet werden. Die Bezeichnung ’mittelgranulare Paral- 
lelisierung’ fiir die Aufteilung stammt aus diesem Zusammenhang, sie wird im folgenden Abschnitt 
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eingehender erlautert. Der zweite Vorteil ist der variable Einsatz fertiger Module. Bei der Op- 
timierung eines Automotors kann etwa die Berechnung der Lichtmaschine in ein AM ausgelagert 
werden. Dieses AM lafit sich allein als Giitefunktion fiir eine Detailoptimierung der Lichtmaschine 
nutzen. In grofierem Zusammenhang kann die Motorberechnung als Teilaufgabe einer Bewertung 
eines Gesamtautomobils verwendet werden. In beiden Fallen sind keine neuen Integrationsarbeiten 
durchzufiihren. Die Module lassen sich wie Teile eines Baukastens miteinander kombinieren. 

5 . Parallelisierung 

Viele Simulationen und Berechnungen stofien an die Kapazitatsgrenzen derzeit verfiigbarer Rechenan- 
lagen bzw. an die Grenzen akzeptabler Rechenzeiten. In einer Optimierung wird eine Simulation 
mehrfach aufgerufen, analog steigt auch der Rechenbedarf bzw. verlangert sich die Rechenzeit. In 
MEPO wird eine Parallelisierung zur Einbeziehung zusatzlicher Rechenleistung genutzt. Notwendig 
ist dabei die Aufteilung der Arbeit in weitgehend unabhangige Teilaufgaben, die Tasks. Als Task in 
MEPO ist die Bearbeitung eines Datensatzes in einem Modul zu verstehen. Die Anzahl und Art der 
eingebundenen Rechner ist beliebig. Vom POI werden auf den verschiedenen Rechnern Kopien der 
MM und AM angestartet. Der Datentransfer wird mittels PVM durchgefiihrt. 

MEPO bietet zwei Moglichkeit der Parallelisierung: auf der Ebene der Optimierung sowie auf der 
Ebene der Giitefunktion. Viele Optimierungsroutinen erzeugen mehrere unabhangige Variationen 
bzw. Datensatze gleichzeitig. Evolutionare Algorithmen bilden in jeder Iteration eine vorgegebene 
Anzahl ’Nachkommen’. Ahnliches Verhalten zeigt das Simulated- Annealing. Bei anderen Algorith- 
men kann jeweils in bestimmten Iterationen parallelisiert werden. Die Simplex-Methode nach Nelder 
und Mead erlaubt dies bei der Initialisierung und bei Kontraktion des Simplex-Korpers. Eine auf 
dem Differenzenverfahren basierende Gradientenstrategie lafit dies bei der numerischen Bestimmung 
des Gradienten zu. Die zweite Moglichkeit der Parallelisierung besteht innerhalb der Giitefunktion. 
Sind zur Ermittlung der Qualitat mehrere unabhangige Teilaufgaben zu bearbeiten, so konnen diese 
auf mehrere AM aufgeteilt und gleichzeitig bearbeitet werden. Eine Parallelverarbeitung ist damit 
auch bei sequentiell arbeitenden Optimierungsroutinen moglich. 

Beide Moglichkeiten sollen anhand einer aktuellen MEPO-Anwendung erlautert werden: der aerody- 
namischen Optimierung von Verkehrsflugzeug-Tragfiiigeln, die in Zusammenarbeit mit der Daimler 
Benz Aerospace Airbus durchgefiihrt wird. Aufgrund des Rechenzeitbedarfs ist ein hoher Grad an 
Parallelisierung angestrebt. Als Optimierungsstrategien werden Simplex- und Gradientenverfahren 
sowie Evolutionsstrategien verwendet. In der Giitefunktion wird eine 3D-Stromung durch 5 bis 10 
2D-Umstr6mungen an Fliigelschnitten approximiert. Die Ermittlungen der 2D-Stromungen bilden 
unabhangige Teilaufgaben. Weiterhin werden in einem sogenannten Multipoint-Entwurf bis zu 8 
verschiedene Flugzustande betrachtet, wie Start und Landung und verschiedene Reiseflugkonfigura- 
tionen. Auch diese Berechnungen sind voneinander unabhangig. Insgesamt sind bis zu 80 einzelne 
Tasks zur Parallelisierung verfiigbar. 

Beide Ebenen der Parallelisierung sind in Kombination nutzbar. Die Anzahlen der Tasks ergibt sich 
multiplikativ. Erzeugt eine Optimierungsroutine 5 unabhangige Fliigelvariationen, so ergeben sich 
insgesamt 5 MM- und bis zu 400 AM-Tasks, die verteilt werden kdnnen. 
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Der Grad der wechselseitigen Unabhangigkeit in einer Parallelisierung wird als ’Granularitat’ be- 
zeichnet. Eine grobgranulare Parallelisierung verlangt nur wenig Datenaustausch zwischen den 
Tasks. In MEPO ist eine solche Struktur auf der Ebene der Optimierungsstrategien zu finden. 
Eine feingranularen Parallelisierung wird durch eine umfangreiche Kommunikation zwischen Tasks 
charakterisiert. Sie wird in MEPO nicht genutzt. Anwendung findet sie etwa in vielen netzbasierten 
Simulations-Codes wie Stromungslosern oder FEM-Programmen. Die Granularitat bei der Aufteilung 
der Giitefunktion liegt zwischen der grobgranularen Parallelisierung auf Optimiererebene und der 
feingranularen Parallelisierung auf der Ebene von Simulationsprogrammen. Sie wird als ’mittelgra- 
nulare Parallelisierung’ bezeichnet. 

6. Adaptive Last vert eilung 

Die in MEPO verwendete grob- und mittelgranulare Parallelisierung stellt nur geringe Anforderun- 
gen an die Kommunikationskapazitaten. Genutzt werden von MEPO daher vernetzte Workstations. 
Diese sind meist als Arbeitsplatzrechner im Einsatz, was immer wieder zu kurzfristigen Spitzenbelas- 
t ungen fiihrt, die Rechner aber durchschnittlich nur gering auslastet [Pe97]. Die Rest-Kapazitaten 
sind ohne weitere Kosten verfiigbar und werden von MEPO fiir parallele Optimierungen genutzt. 
Sehr grofie Rechenleist ungen werden nach wie vor nicht von Arbeitsplatzrechnern, sondern von zentral 
administrierten Main-frames vorgehalten. Auf deren Kapazitaten kann in der Regel nur indirekt iiber 
Batch-Systeme zugegriffen werden. In MEPO konnen aus dem POI heraus Batch-Jobs generiert 
werden. Kommen sie zur Ausfiihrung, so wird ein Modul gestartet. Es nimmt iiber PVM Kontakt 
zum POI auf und wird wie andere Module eingebunden und verwendet. Ein gemischter Betrieb von 
Online- und Batchrechnern ist moglich. Beim Abbruch eines Jobs wegen Laufzeitiiberschreitungen 
konnen bis zum Ende der Optimierung weitere Batch- Jobs generiert werden. 

Workstation-Pools sind in der Regel heterogen. Sie enthalten unterschiedlich leistungsstarke Rech- 
ner, teils von verschiedenen Herstellern. Durch eine wechselnde Anzahl weiterer Nutzer ergeben sich 
stark schwankende Belastungen und damit Bearbeitungszeiten auf den einzelnen Rechnern. Auch 
beim Einsatz von Batchrechnern ergeben sich grofie Geschwindigkeitsunterschiede. Eine einfache 
Verteilung der verfiigbaren Tasks auf die vorhandenen Rechner wiirde zu einer sehr ineffektiven 
Ausnutzung der Kapazitaten fiihren. Einzelne Tasks auf leistungsschwachen oder belasteten Rech- 
nern konnen die Bearbeitung derart verzogern, dafi es gegeniiber einer seriellen Bearbeitung zu 
Verschlechterungen kommt. 

In MEPO werden mittels einer adaptiven Lastverteilung die Tasks so verteilt, dafi sich moglichst 
kurze Iterations- Zeiten ergeben. Basis fiir die Verteilung bilden die Bearbeitungszeiten der Tasks aus 
vorherigen Iterationen. Pro Rechner ist jeweils nur ein Task aktiv, um Verzogerungen durch Swappen 
zu unterbinden und die Laufzeiten moglichst ungestort zu ermitteln. Einer systemeigenen Routine 
wurde dabei der Vorzug gegeniiber allgemeinen Lastverteilungssystemen gegeben [LS94], um die a 
priori vorhandenen Zeitabschatzungen effektiv einsetzen zu konnen. Integriert ist die Lastverteilung 
zentral im POI-Modul. 

Die Verteilung von Tasks unterschiedlicher Bearbeitungszeiten auf mehrere unterschiedliche Rech- 
ner ist an sich bereits ein NP-vollstandiges Verteilungsproblem, das eine eigenstandige Optimierung 
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benotigt. In MEPO kann sich dariiber hinaus die Anzahl der Tasks durch die mittelgranulare Par- 
allelisierung andern. Weiterhin soil moglichst rasch auf Lastanderungen reagiert werden. Es wurde 
daher ein Verfahren entwickelt, das auf die optimale Losung verzichtet, mittels Heuristiken aber sehr 
schnell eine effektive Task-Verteilung erzeugt und diese kontinuierlich an die Lasten anpafit. 

Das Lastverteilung erzeugt eine Abfolge von temporaren Verteilungen. Die Tasks werden dabei 
nacheinander den Rechnern zugewiesen. Ziel beim Verteilen jedes weiteren Tasks ist, eine moglichst 
kurze Iterations-Zeit zu erreichen. Der Task wird dem Rechner zugeschlagen, der bei zusatzlicher 
Bearbeitung des Tasks die Gesamtzeit fiir die Iteration am geringsten ausweitet. Beriicksichtigt 
werden dabei bereits verteilte, unbearbeitete Tasks, Restlaufzeiten von aktiven Tasks sowie Kom- 
munikationszeiten. Auch der Ursprung der Tasks mufi beachtet werden, um keine ’Deadlocks’ zu 
erzeugen. Tasks der mittelgranularen Parallelisierung werden dabei mit hoherer Prioritat verteilt. 
Um eine gute Auslastung zu erreichen werden die Tasks weiterhin der Grofie nach sortiert. Durch das 
Zuriickhalten von kurzlaufenden Tasks lassen sich die Leerlaufphasen von Rechnern am effektivsten 
reduzieren. 

Die Tasks werden einzeln entsprechend der Verteilung angestartet, bis bestimmte Ereignisse eine Neu- 
verteilung der noch unbearbeiteten Tasks erforderlich machen. Solche Ereignisse sind das Leerlaufen 
eines Rechners oder die Erweiterung der Task-Menge durch die mittelgranulare Parallelisierung. 
Durch die wiederholte Neu verteilung der Tasks wird flexibel auf deren Zu- und Abnahme reagiert. Die 
Verwendung immer der aktuellsten Laufzeiten erlaubt die sehr schnelle Reaktion auf Lastschwankun- 
gen der Rechner. Ein weiterer Vorteil ist, dafi sich wahrend der Optimierung neue Rechner in das 
System integrieren bzw. daraus herausnehmen lassen. Dies kann Benutzer-gesteuert erfolgen; eine 
weitere Moglichkeit ist die zeitgesteuerte Nutzung von Rechnern, etwa nur in den Nachtstunden. 

Die mit der Verteilung zu erreichende Effizienz soil anhand der oben erwahnten Tragfliigeloptimierung 
mit dem MEPO-System in Bild 2 gezeigt werden. Optimiert wurde dabei ein Einzelprofil durch eine 
Evolutionsstrategie. In jeder Iteration wurden 9 Profilvarianten erzeugt, zu deren Qualitatsbestim- 
mung ein in ein AM ausgelagerter Stromungsloser genutzt wurde. Es ergaben sich 9 MM- und 
9 AM-Tasks mit Laufzeiten zwischen 4 bis 7 bzw. 140 bis 180 Sekunden. Verwendet wurden 3 
HP-J180 Workstations im Tagesbetrieb. Es wurden 115 Iterationen in ca. 15 Stunden gerechnet. 
Dargestellt sind die akkumulierten Zeiten auf den 3 Rechnern iiber den Iterationen. Der Overhead 
durch Kommunikation, Verwaltung und Wartezeiten betragt dabei unter 5% der Gesamtlaufzeit der 
Rechner. 

7. Checkpoint/Restart-System 

Beim Einsatz von vernetzten Workstations ergibt sich das Problem, dafi Prozesse nicht so stabil 
laufen wie auf Grofirechnern oder echten Parallelrechnern. Gerade bei Datenzugriffen iiber belastete 
Netzwerke kommt es immer wieder zu Ausfallen [Pe97]. Bei Laufzeiten von einigen Stunden bis 
Tagen auf mehreren Rechnern ergeben sich signifikante Ausfallwahrscheinlichkeiten. Daher wurde in 
MEPO ein Checkpoint /Restart-System integriert. 

Samtliche Daten eines Tasks befinden sich in einem Datensatz, der an das POI gesendet und dort zwi- 
schengespeichert wird. Wahrend der Bearbeitung eines Tasks bleibt eine Kopie erhalten. Bei Ausfall 
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Bild 2: Effizienz der adaptiven Lastverteilung bei einer Parallelisierung 



eines Moduls oder Rechners konnen die betrofFenen Tasks erneut verteilt werden, so dafi es keinerlei 
Riickwirkung auf die Optimierung gibt. Die Abhangigkeiten der Tasks miissen dabei beriicksichtigt 
werden. Bei einem Ausfall werden daher im Rahmen der mittelgranularen Parallelisierung erzeugte 
Tasks teilweise gestoppt und geloscht. 

Im oben genannten Beispiel gab es eine Reihe Netz-induzierter Modulausfallen. Sie sind in Bild 2 
markiert. Aufier erhohten Verwaltungs- und Wartezeiten in den betrofFenen Iterationen ergaben sich 
keine Auswirkungen auf die Optimierung. 



8. Aktuelle Anwendungen 

Die Hauptarbeiten an der MEPO wurden am Institut fiir Flugmechanik und Raumfahrttechnik der 
TU Braunschweig IFR durchgefiihrt. Wahrend der letzten Jahre wurden dort eine Reihe von ver- 
schiedenen Parameteroptimierungen mit MEPO und VorlauFerprogrammen erfolgreich abgeschlossen. 
Von den am Institut durchgefiihrten Arbeiten seinen beispielhaft die Optimierung von Aufstiegs- und 
Wiedereintrittsbahnen von Raumtransportern genannt [Ax94]. Als Optimierungsroutinen wurden 
hauptsachlich Evolutionsstrategien (ES) eingesetzt. 

Die aktuelle Anwendung von MEPO am IFR findet im Rahmen des MEGAFLOW-Projektes statt. 
Es handelt sich um ein Verbundvorhaben zwischen verschiedenen Hochschulinstituten, dem DLR 
und der Daimler Benz Aerospace Airbus zur Entwicklung eines Stromungssimulationscodes. MEPO 
wird dabei fiir die mehrfach erwahnte Optimierung der Geometrieparameter eines Tragfliigels ver- 
wendet. Ziel ist die Entwicklung eines Tragfliigels fiir das Verkehrsflugzeug A3XX. Eingesetzte Op- 
timierungsstrategien sind Gradienten- und Simplex- Verfahren sowie ES [HA96], [AHF96]. 

Ein weiteres Projekt des IFR ist die Optimierung von Konstellationen fiir moderne Satellitennaviga- 
tions- und -kommunikationssysteme in Zusammenarbeit mit der Firma Dornier. Dort wurden zum 
einen die Anzahl von benotigten Satelliten und deren Bahnen optimiert, zum anderen die Posit ionen 
von Bodenstationen zu deren Betrieb [PAD97]. Auch hierbei handelt es sich um Parameteropti- 
mierungen mit der Evolutionsstrategie. 
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Am Institut fiir Warme und BrennstofFtechnik der TU Braunschweig wird MEPO fiir verschiedene 
Optimierungen im Kraftwerksbereich eingesetzt, zum Teil im Auftrag der Firma ABB. Herauszuhe- 
ben ist die Optimierung eines Kombikraftwerkes mit einer modifizierten Evolutionsstrategie, in dem 
gleichzeitig die Anzahl und Anordnung der Kraftwerkskomponenten und deren Parameter bestimmt 
werden. [DRA97] [DGL98]. 

Ein Reihenfolge und Anordnungsproblem wird auch bei den Firmen Siemens und Studsvik-Skand- 
power mit MEPO bearbeitet. Bei der jahrlichen Revision von Kernkraftwerken werden die Brennele- 
mente des Reaktor kerns umverteilt. Ziel ist eine gleichmafiige Leistungsverteilung im Reaktor, sowie 
die Minimierung der Anzahl frischer Elemente. Als Strategie wird eine modifizierte ES verwendet, 
die mittels Heuristiken eine sehr rasche Konvergenz ermoglicht [AT95]. Wartung und Support fiir 
MEPO wird bei der Firma Scandpower sichergestellt. 

9. Zusammenfassung und Ausblick 

Mit MEPO wurde in den letzten Jahren an der TU Braunschweig ein System geschaffen, das Ent- 
wickler und Anwender der numerischen Optimierung mafigeblich unterstiitzt. Die Abbildung aller 
Grofien auf Parameter erlauben den flexiblen Einsatz von Giitefunktionen fiir verschiedene Opti- 
mierungen ohne weitere Integrations- und Anpassungsarbeiten. Die Aufteilung von Giitefunktionen 
in verschiedene Module ermoglicht die Wiederverwendung in anderen Zusammenhangen. 

Grob und mittelgranulare Parallelisierung unterteilen die Qualitatsbestimmung in eine Vielzahl 
unabhangiger Tasks. Eine adaptive Lastverteilung ermoglicht die Einbindung beliebiger weiterer 
Rechenkapazitaten. Sie sorgt mittels iterativer Zuweisung der Tasks zu den Modulen zur Erzielung 
kurzer Iterations-Zeiten. An einem Praxisbeispiel wurde die Qualitat der Verteilung gezeigt. Die 
Einsatzreife und Effizienz von MEPO zeigen sich in vielen Anwendungen an Hochschulinstituten und 
in Industrieprojekten. 

Das Institut fiir Wissenschaftliches Rechnen IWR der TU Braunschweig hat das jetzige MEPO- 
System iibernommen und entwickelt weitere Systemmodule. Hierzu gehoren etwa ein Manage- 
mentsystem zur Auswahl und Steuerung von Optimierungsstrategien, eine JAVA-basierte, graphische 
Oberfiache und eine Datenbankanbindung. Sie werden in absehbarer Zeit in MEPO integriert und 
dem Benutzer zur Verfiigung stehen. 
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A Simulation Platform for Multiagent Systems in Logistics 



Heinz Ulrich, Swiss Federal Institute of Technology, Zurich 

Summary: The challenges in today’s global economy are flexibility and fast reactions to customer 
requests. To cope with these requirements there is a need for intelligent adaptive planning and 
control systems. The multiagent approach offers favorable prospects for adequate solutions. A 
simulation platform for the development of multiagent systems in logistics is presented. Logistics 
is viewed as socio-technical which enables to take into account human resources within the design 
process explicitly according their importance. This simulation platform is based on the discrete-event 
simulation environment HIDES developed at our institute. The modeling bases of the simulation 
is a reference model of logistics comprising few specific basic elements to be used according to a 
well-defined syntactical structure. To realize multiagent systems a particular management level is 
added, where the agents and the interfaces to human resources are located. The platform is intended 
to provide a substantial support for developing multiagent systems. 



1. Introduction 

The rapid development of economy and technology of the recent years influenced logistics manage- 
ment fundamentally. On the one hand problem dimension and complexity increased on the other 
hand the potential for problem solving increased as well if we consider the progress in Information 
technology as to its performance in data-processing and communication. 

In economy the changes are caused by a growing globalization of the markets and an increasing ex- 
tension of the business network which created new challenges for logistics management. At the same 
time information technology, partly a catalyst in economic development, provided new facilities for 
communication and data processing which open new perspectives for an efficient decision support in 
logistics management. The mentioning of Internet and the impressive performance of Deep Blue in 
competition with the world champion in Chess may suffice to illustrate the inherent potential. 

2. New Characterization of Logistics Problems 

Dealing with logistics problems, it is fairly obvious, that in a certain level of abstraction we basi- 
cally are always meeting similar problems, be it transportation, production or inventory systems. 
Under the new circumstances the general abstraction of logistics problems could be characterized as 
follows: We distinguish between the system and its environment. The environment defines targets 
for the output of the system consisting in quantitative demand and delivery performance required. 
The system itself can be characterized as distributed system which means, system elements ( e.g. 
resources, facilities) are locally distributed over a large area. But the distribution not only concerns 
resources and facilities, management and information processing is also distributed within the sys- 
tem. Dynamics within the system is expressed by processes designed to accomplish the given tasks. 
They interact and share the same scarce resources. Logistics management consist in directing the 
system’s processes in a .coordinated way to fulfill the given targets in the best possible way. The 
system is to view as socio-technical system, besides automated data processing human resources are 
involved directly in the decision process. 
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3. Today’s Requirements 

The tasks within Logistics Management are focused on planning and control, today’s requirements 
may be summarized as follows. 

An efficient use of scarce resources alone doesn’t suffice any longer, dynamic aspects are the longer 
the more decisive. High flexibility and fast reactions to internal and external changes are important 
qualities. As to the design of computer supported planning and control systems, adaptability is 
required which means, planning and control directives have to be permanently updated to guarantee 
a remaining on track with the development of the system’s environment. This concerns not only 
the system state, but also the knowledge about the future, expressed in forecasts. The quality of 
the computer support in the decision process should be intelligent Considering today’s potential in 
information technology, a support should provide more than data administration, an active direction 
towards goal fulfllling solutions is expected which implies a certain degree of intelligence. Not only 
the environment is changing fast, the system itself is subject of changes. A system design is therefore 
never finished, future changes and extensions have to be easy to implement, the system has to be 
extendible. 

4. Multiagent Systems 

Planning and control systems meeting today’s requirements have to be conceived therefore as in- 
telligent distributed systems. The artificial intelligence community proposes agent based concepts 
which seem to be promising in our context. Agents are software artifacts which are able to act. au- 
tonomously without assistance of the user. They react to system states and pursue defined goals. In 
a distributed environment a community of agents are introduced, the system is then called multiagent 
system. 

4.1 The Agent Concept 

For our purposes we use the term agent in its the weak notion according to M. Wooldrige and N.R. 
Jennings [1] : 

Definition: 

An agent is an entity in a software-based computer system with the following properties: 

• autonomy: agents operate without the direct intervention of humans or others, and have some 
kind of control over their actions and internal state; 

• social ability: agents interact with other agents (and possibly humans) via some kind of agent- 
communication language; 

• reactivity: agents perceive their environment (which may be the physical world or the model), and 
respond in a timely fashion to changes that occur in it; 

• pro-activness: agents do not simply act in response to their environment, they are able to 
exhibit goal-directed behavior by taking the initiative. 
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These agents need a quality we usually call intelligence. The exact definition of intelligence is still 
controversial, the following limited definition seem to be useful in our context: 

We distinguish between the following two degrees of intelligence: 

Reactive intelligence bases on knowledge which is represented in rules or in decision trees. The solu- 
tion process provides an adequate reaction to the actual situation and consists of a search procedure 
on the individual knowledge base. Reactive Intelligence comprises all the well known-conventional 
automation where an apparatus reacts to input signals according to well-defined rules The possible 
outcome is selected among a number of predefined actions, (e.g. finite state automata) 

Cognitive intelligence includes reactive intelligence but in addition more demanding problem solving 
procedures are at hand A base of methods offers a selection of methods ready for use. These methods 
can be applied in a problem solving procedure where solutions are generated in view of explicit goals, 
valued according to relevant criteria to choose finally the best one for realization. 

Corresponding to the above definition of intelligence we define agents with different levels of intelli- 
gence: 

The reactive agent 

Reactive agents are provided with an intelligence of reactive quality, that is they act using a stim- 
ulus/response type of behavior and are able to respond to the present state of the environment in 
which they are embedded. They do not have a representation of their environment in their knowledge 
base. 

The cognitive agent 

Cognitive agents are provided with an intelligence of cognitive quality, that is they are driven by 
intentions, i.e. by explicit goals that conduct their behavior and make them able to choose between 
possible actions. They have a symbolic and explicit representation of their environment in their 
knowledge base on which they can reason and from which they can predict future events. 

4.2 Internal Structure of an Agent 

In agent’s design we propose a general internal structure for each agent containing the following 
elements: 

Characterization: 

The characterization comprises individual name and localization within the system. 

Intelligence degree: 

The competence of an agent as to its intelligence is indicated by his association to one of the two 
agent types (cognitive or reactive). 

Position: 

Each agent has a well-defined position in the management hierarchy level within the agent community 
and the communication links with other agents are determined. 

Information base: 

An agent has a individual information base which can be extended by the defined communication 
links. The information base contains knowledge, methods and information about the system state 
within the running process. 
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Task: 

Each agent has well-defined responsibilities and herewith related specific tasks he has to complete 
alone or in cooperation with other agents. 

Goals: 

Related to his responsibilities an agent has his given goals he is able to adapt to operational goals 
for his specifique tasks. The coordination of the individual goals of the agents is performed within 
the management process and is a crucial problem of its design. 

Capabilities: 

For problem solving each agent has his individual problem solving engine designed to solve his 
individual tasks. His capabilities are in correspondence with his available degree of intelligence. In 
case of a^iooperative problem solving each agent may use his engine in a problem solving cycle several 
times, until the common solution promises satisfactory results. 

4.3 Problem Solving Engine 

Each agent is able to solve problems independently. The proceeding for their problem solving is called 
engine. A problem solving engine is designed according to a general problem solving proceeding in 
management science e.g. Systems Engineering [2] . The following cycle for problem solving is 
proposed: 

1. Problem definition: Analysis of the present state in search for problems which need to be 

solved. 

2. Search for goals: Among the given goal or goal hierarchy choosing the most adequate 

operational goal to be addressed in the above identified problem. 

3. Search for solutions: Based on the available potential on knowledge and methods possible 

solutions are generated. 

4 . Final decision: The generated solutions are valued according to criteria corresponding 

the operational goal determined in step 1. 

In case of the lower intelligence level of the reactive type, the last two steps are reduced to one single 
step, a simple search procedure. 

4.4 Agent Types 

In our view logistics systems are sociotechnical systems. Especially in logistics management human 
resources are involved in various functions and responsibilities within the decision process, this in- 
terference has to be taken into account in the modeling environment. We therefore conceived the 
following two types of agents: 

The control agents have to accomplish management tasks corresponding to those in a human envi- 
ronment. Therefore, capacities for problem solving are required which include in their turn a certain 
degree of intelligence. These agents dispose of general procedures for problem solving, the so-called 
‘engines’, which have to be specified individually for each application case. 
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The interface agents establish the necessary interface for the interaction with human resources. They 
are built up according to a general scheme as the control agents with individual specification accord- 
ing to their specific task. They don’t have any direct control duties, these functions are delegated 
to the control agents, therefore they lack the ability for problem solving. Their duty consists in es- 
tablishing the necessary communication links and in procuring the required information concerning 
the actual state of the system. By means of these interface agents a human operator or manager 
is able to interact within the running process, she/he can stop the simulation run, insert her/his 
instructions and continue the run afterwards. 

4.5 Agent Communities 

In a multiagent system several agents form a community which has to be organized. We suggest 
a structure following as close as possible human management organizations as guidelines with their 
internal communication and hierarchy. These agents have each their specific planning and control 
task which they fulfill together in a well defined and cooperative way. 

5. Architecture of a Simulation Platform for Multiagent Systems 

In logistics management system design is a process which proceeds from concept to realization in 
several steps including iterations if required. Before final implemen- tation dynamic aspects of the 
system can only be tested in a model by simulation in an experimental environment. The final 
validation of a concept for practice will be given by its success in application. 

Simulation is therefore an important method applicable in various contexts [3]. The development 
of simulation models is a very time-consuming job, especially the necessary frequent adaptation of 
the models within a dynamic development process demands a well-designed fiexible model base to 
perform this task efficiently. That is the reason why we try to define an architecture for a simulation 
platform which is able to support the design process for multiagent systems in logistics substantially. 

5.1 Reference Model 

The architecture of a simulation platform in a specific application field should be based an a meta 
model of this environment, so called “reference models” as Schmidt [4] proposes. As reference model 
for logistics systems we propose the following abstraction: 

We define three levels a physical, a logical and a management level. This distinction allows a sepa- 
ration of the physical part of the system from its planning and control part. As we focus in logistics 
management on planning and control, the physical level often remains unchanged whereas model 
modifications are necessary only on the other levels. By this separation, the model design process is 
much more transparent herewith. On the physical level the static structure of the system and the 
dynamic sequence of the physical events are represented. The relationships are viewed as a directed 
graph with two types of nodes, stations and queues. Other important model elements are resources 
introduced as abstract elements. They can be allocated dynamically and dispose of information 
about their availability and their behavior. Dynamic elements, moving in the graph, are introduced 
as entities, which are characterized and identified individually. In the real world, logistics systems 
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are in general very large and very complex. To perform the modeling task efficiently, a top-down 
approach is indicated with a stepwise increase in the degree of detail. As a consequence, we get a 
model in different hierarchical abstraction levels. To support such a proceeding in modeling, the 
building of subnets is provided.. In a network containing various subnets on different hierarchical 
levels a strictly logically consistent hierarchical structure is enforced. A subnet is always incorporated 
in a station in which the subnet substitutes the logic element of the logical level. 

The logical level comprises the definition of the processes to be executed on the physical level, the 
necessary information flow to fulfill this purpose and their logical control. The nodes on this level 
are logic elements each in charge with the triggering of one specific station. They control the fiow 
process in a decentralized way herewith. The entities arriving at the associated station are handled 
by the logic element according to a general default proceeding or an individually defined protocol 
containing activities as to generate, duplicate, split, modify and remove the arriving entities. In 
addition any algorithmically definable control logic can be implemented. The information is stored 
partly globally, partly locally in stations and entities. Whereas the global information is globally 
available, the locally stored information is only available in the local environment. An extension of a 
local information base is possible by defining explicitly Information fiow channels to supplementary 
information sources.. The local information basis for a logic element can be extended herewith. 

The management level is an extension of the logical level and is used to introduce our agent commu- 
nity. For a planning and control process needing more then only static decision rules a multiagent 
approach is indicated. The agents have to perform the automated part of the management tasks. 
This extended level provides the basis for developing the coordination process among the agents, 
which formally regulates competence and responsibility of the individual agents. Agents may inter- 
fere on the logical level in the triggering process of a station or directly to the information base of 
moving entities or resources. 

To enable the interaction of human resources into the management process as it will happen in reality 
we introduced a special type of agents providing the required interface functions, the interface agents. 
Our multiagent system disposes therefore of two types of agents. The control agents destined for 
executing the automated control tasks and the interface agents for establishing the necessary inter- 
face for the interaction with human resources. Herewith a large field for interesting studies is opened 
concerning the investigation of the distribution of tasks in logistics management between technical 
system and human resources. 

A graphical representation of reference model for a multiagent system in logistics is shown in Figure 1. 



5.2 System Control 

Discrete event simulation is the most appropriate method for representing a decision oriented dy- 
namic process as we have to deal with in logistics management. Process control is performed by 
an event list containing events and their due date. The events are sorted after their due dates and 
executed in the determined time sequence. In a multiagent system this simple procedure is not suf- 
ficient. Events are not only generated as a consequence of other events, agents with there proactive 




267 



Architecture of Multiagent Systems 




Figure 1: Architecture of multiagent systems 

behavior are independent actors in the system generating additional events. System control in a 
multiagent system is therefore a more demanding task which has to include the synchronization of 
agent’s activities as to simultaneousness and clearing up of contradictions in actions. 

We suggest the following approach for system control in a multiagent system which corresponds 
essentially to an approach Eschenbacher [5] proposes for dealing with simultaneous events in a con- 
ventional event list: 

As in discrete-event simulation we have a sequential processing of the event list. Each agent enters 
his next proactive action in the event list. The event list is processed as follows: 

• Consolidation of the present system state (updating) 

• Cyclic call to all the agents for reaction and release of the responding actions. 

• Clearing up of conflicts resulting out of the initiated actions beforehand. 

• Consolidation of the new system state (unique definition of the result) 

• Proceeding to the next time in the event list and repeating again the same procedure 

5.3 The Simulation Platform HIDES 

At our institute the simulation platform HIDES (Highly Interactive Discrete Event Simulator, cf. 
Graber et al. 1993, [6], Ulrich et al. 1994, [7]) was developed for the simulation of logistic systems. 
HIDES is a general purpose simulation framework which is designed for event-oriented simulation, 
and implemented in an object- oriented environment. It is based on the above described reference 
model but comprises until now only the first two levels, the physical and the logical one. 

As we have the source code of the simulation platform available for our purposes we intend to realize 
the above described extension of our simulation platform in a stepwise procedure. For specific ap- 
plications we add the supplementary management level closely connected to the practical projects. 
After several specific realizations we will try to identify the best suitable general structure for the 
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management level, in particular we will try to provide a general shell structure for agents in order to 
facilitate substantially the design process of new systems. 



6. Conclusions 

To cope with today’s requirements in economy and industry an efficient management of logistics on 
all levels is essential. The use of simulation in the development process for an adequate decision sup- 
port in logistics management allows to test various solution approaches before their realization. The 
better a simulation platform corresponds to their application environment the easier the development 
process will be. As the multiagent approach promises to be successful in logistics management, we 
propose an architecture for a simulation platform which includes a management level for introducing 
agents. The design of agents is intended to be supported by a general shell of an agent which has to 
be completed individually according to their specific purpose and duties. As the multiagent approach 
up to now is only realized in very simple examples on relatively low levels of intelligence, the benefit 
of this architecture will be increasing in future with the increasing potential and quality of multiagent 
solutions. 
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Summary: We consider nonparametric generalization of various well-known financial time series 
models and study estimates of the trend and volatility functions and forecasts based on kernel 
smoothers as well as on neural networks. 

1. A nonparametric Black-Scholes model in discrete time 

During the last decade, a lot of parametric models for financial time series have been studied. To 
decide which of these models is appropriate for a particular data set, nonparametric methods may 
be used analogously to the situation in regression analysis (Hardle, 1990). Using the latter, one may 
derive confidence bands for trend and volatility functions, develop tests of particular parametric hy- 
potheses against quite general alternatives or construct nonparametric forecasts which do not depend 
on specific and perhaps restrictive assumptions on the structure of the time series. We illustrate this 
approach with three examples, starting from a rather straightforward nonparametric ARCH-model 
which may also be interpreted as a generalization of the Black-Scholes model in discrete time, then 
proceed to a nonparametric stochastic volatility model and finally consider forecasting a autoregres- 
sive time series with exogeneous components using a neural network-based approach for estimating 
the prediction function nonparametrically. 

The Black-Scholes approach to option pricing is based on modelling the price s{t) of the underlying 
asset at real- valued time t as a geometric Wiener process, i.e. as a solution of the stochastic differential 
equation 

ds{t) = /i s{t)dt -I- a s{t)dw(t) 

where, w{t) is a standard Wiener process, and the unknown parameters /i and a are interpreted as 
the expected return per time unit and as the volatility of the price process. A discrete-time version 
of this model is (compare, e.g., Hull, 1997, ch. 10.4) 

St+i = (1 + lj)St + cr StSt+i (1) 

where St, t integer, are independent standard normal random variables. Over a reasonable period 
of time, this type of model provides a useful approximation to, e.g., stock prices or foreign exchange 
rates. However, under particular circumstances, assumptions like a constant volatility independent 
of time and price level are violated. We therefore consider a generalization of (1): 

St+i = m{St) + (j(St)£t+i (2) 

where St are independent identically distributed (i.i.d.) random variables with mean 0 and variance 
1. m(a;), a{x) are arbitrary functions up to some regularity assumptions. For the particular choice 

m{x) = 0 and cr^(x) = a; + ax^ , 
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(2) becomes an ARCH(l)-process (Engle, 1982), the ancestor of the popular ARCH-GARCH-family 
of financial time series models (Bollerslev et al., 1992). (2), therefore, may be seen as a nonparamet- 
ric generalization of an ARCH(l)-model, too. 

Nonpar ametric estimates of the trend function m and the volatility function a have been the topic 
of many papers starting with the work of Robinson (1983). A popular choice of estimates are 
kernel estimates constructed in an analogous manner to the Nadar aya- Wat son estimates in regression 
analysis (Hardle, 1990) from a sample Sq,. St • 

m{x, h) = ^Y,K St/Pixy h) (3) 

where a kernel estimate of the stationary marginal density p(x) of the St is used: 

».>-) = ^ p(^)' W 

K(u) is a given kernel function satisfying some standard regularity assumptions, e.g.: 

(Al) 0 < K(u) < Ck for some constant Ck, J uK{u)du = 0, / u^K(u)du < cx), / K{u)du = 1. 

The bandwidth h controls the smoothness of the function estimates m, a, p. For simplicity of nota- 
tion, we have chosen the same bandwidth for all three estimates. From a theoretical viewpoint, this 
is not necessarily so, and in applications, band widths for estimating trend and volatility may differ. 

The asymptotic properties of the estimates m, a and p are now well understood. Robinson (1983) has 
shown asymptotic normality, and Collomb and Hardle (1986) have proven uniform strong consistency. 
Asymptotic expansions of bias and variance have been derived by Auestad and Tjostheim (1990) and 
Masry and Tjostheim (1994). However, asymptotic normality provides reasonable approximations 
to, e.g., the distribution of m only for quite large sample sizes as has been illustrated by Franke 
and Wendel (1992) in a simulation study. For practical purposes like calculating confidence bands 
or testing parametric hypotheses against nonparametric alternatives, other approximations are of 
interest like, e.g., the bootstrap which, since Efron’s (1979) seminal paper, has become a standard 
tool in a statistical analysis. In nonparametric situations, however, the bootstrap has to be designed 
with some care as straightforward applications of the general ideas do not always work. 

For kernel estimates of the autoregression function m and the volatility function cr, various bootstrap 
procedures have been studied recently by Franke et al. (1997). For sake of illustration, we consider 
only one of them, and apply it to get confidence bands for m and a based on some stock index series. 
To begin with, we impose some assumptions on the model (2): 

(A2) m and cr are Lipschitz continuous functions with Lipschitz constants Lm^L^ resp. 

(A3) ctq = inUeB. c^{x) > 0. 
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(A4) The innovations St are i.i.d. random variables with mean 0, variance 1 and probability density 
Pe satisfying inf^gcPeC^) > 0 compact sets C. 

(A5) Lm + La S\si\ < 1. 

These assumptions are stronger than what is really needed but, in this form, easy to formulate. 
They guarantee that there exists a stationary and geometric ergodic solution 5t, —oo < t < oo, of 
(2). Geometric ergodicity implies, in particular, the strong or a-mixing property with exponentially 
decreasing mixing coefficients. Therefore, the time series {St} shares many properties with sequences 
of i.i.d. random variables like a central limit theorem for the mean, Bernstein inequalities etc. 



If a = 1, (2) is a nonlinear autoregression of order 1, and for general a it also defines a Markov 
process. Therefore, we mimick the familiar residual-based bootstrap for autoregressions (compare, 
e.g., Kreifi and Franke, 1992.). As a first step, we calculate sample residuals 

. ^ St-m{St-ug) 
a(St-ug) 



i.e. we solve (2) for St and replace the unknown functions m, a by some initial estimates of kernel 
type, but with arbitrary bandwidth g. Some of these sample residuals are rather crude approximations 
of their true counterpart £t. m(a:, g) is formed by a local smoothing operation and, therefore, is an 
unreliable estimate of m(x) if only few data Sk, 1 < ^ < T, happen to be close to x. Due to our 
above assumptions, with large probability this will only happen if |a:| > 7 t for some constant 7 t 
depending on T. Therefore, we retain only the reliable sample residuals ft, t € A, where A = {t = 
1, . . . , T; \St-i\ < 7t}- Then, the remaining residuals are centered: 






such that the empirical distribution Ft of the ft? ^ € A, has mean 0. For theoretical purposes only, 
we have to smooth Ft by convoluting it with Hb(u) = \H{^) where H is & probability density with 
mean 0 and variance 1, e.g. the standard normal density. The smoothed empirical law = Ft* H b 
then has a density. Now, we draw i.i.d. bootstrap residuals ej, t = 1, . . . ,T, from the distribution 
Ft, 6, and we get the bootstrap sample 5*, . . . , by 



s: = m{sU9) + »{sUgyt 






with, for sake of simplicity, Sq = Sq. Now, we can calculate bootstrap versions m*(x,h), a*{x,h) 
and p*{x, h) by replacing the original data in (3) - (5) by the bootstrap resample . . . , 5^. 



It can be shown, that this type of bootstrap works in the sense that the distributions of the adequately 
centered and scaled estimates m*(x, /i), a*{x, h) in the bootstrap world provide good approximations 
of the distributions of m{x, h) and (j(x, h) in the real world. To formulate a corresponding theorem 
we need some technical assumptions: 

(A6) m,a,Pe,H are twice continuously differentiable with bounded derivatives. Se\ < oo, and 
suPxeR k Pi(^)l < 00 . / v"^H{v)dv < oo, / v^\H\v)\dv < oo. K has compact support [-1,+1], is 
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symmetric and three times continuously differentiable. 

(A7) 6 0 and gj\P ->• 0 as T oo, 7 r oo, 7 r/logr is bounded and inf |x| < 27 ^ 7 ^ 0 Pe (a:) > 

(plogT)^ as T ^ 00 . 



Theorem 1 (Franke et al., 1997): Assume (Al)-(A7). Let g,h 0 for T 00 such that 
Th^ — > c>0 and g ~ T~® with 0 < a < ^. Then, for all x elR: 

dK{VTh{m*{x,h) — m{xjg)}j ^/Th{m(x,h) — m{x)}) — > 0 

p 

dj<'(v^Th{(7*^(x, /i) — a^(x,p)}, y/Th{a^(x,h) — cr^(x)}) — 0 

p 

Here, for real random variables X, Y with distribution functions F, G resp., 
dK{X, Y) = dK{F, G) = sup„^R |F(u) - G{u)\ 
denotes the Kolmogorov distance between the distributions of X and Y, 





To illustrate the performance of the bootstrap, Figures 1 and 2 show the results of a small Monte 
Carlo study for a time series St of type ( 2 ) with linear trend m{x) — 0.7x , ARCH-like volatility 
cr^(x) = 0.01 + O.Sx^ and standard normal innovations St . Figure la shows the true function m, 
a kernel estimate m(x, h) with h = 0.05 and a set of 99%-confidence intervals for m{x) for an 
equidistant grid of values of x , joined to form a pointwise confidence band. The latter is based 
on a Monte Carlo-simulation of size 500. Figure lb shows again the true 99%-confidence band of 
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Figure la together with a bootstrap approximation based on the same sample as the kernel estimate 
of Figure la. For calculating the bootstrap band, 500 resamples were used where as the bandwidth 
of the reference kernel estimate g = 0.06 and as the cut-off point 7 t = 10 log T have been chosen. 
Figure 2 shows the same type of picture for instead of m using the same bandwidths h and g . In 
both cases, the bootstrap confidence bands approximate the true confidence quite well, in particular 
in the range of values x close to 0 where the process St stays most of the time. 

2. General nonparametric stochastic volatility models 

A common approach to improve the Black-Scholes model allows the volatility to be not only time- 
dependent or - as in chapter 1 above - a function of past prices but to be a stochastic process a{t) 
in its own right. Various stochastic volatility models have been proposed which differ with respect 
to the stochastic differential equation prescribing the evolution of the volatility process. To apply 
these models some parameters have to be estimated, and for this purpose time-discrete versions are 
considered. As a result of such a discretization process starting from a continuous-time stochastic 
volatility model investigated by Scott (1987) and Wiggins (1987), Taylor (1994) studies the so-called 
lagged autoregressive random variance (LARV) model for the return Rt = log{St/ St-i) of an asset 
with price St at time t: 



Rt = fi-}- CFt-l£t 

log at = cj-h P{\og (Tt-i -u} + 'dei 

If we assume, for sake of simplicity, that the mean return /x is known and if we use the abbreviations 

2 

»?« = 2 

— logcTt and it = ^6^, 

where Hri is chosen such that £774 = 0 , the LARV model can be written as 

= 6-1 + it = ^(ft-i) + (6) 

with m{u) = a; + ^{u — a;}. For general function m instead, ( 6 ) models the log- volatility as a non- 

parametric autoregression of order 1 as discussed in chapter 1. However, ft is not observed directly. 
For estimating m, only the transformed returns Xt are available. ( 6 ) formulates the relation between 
observables Xt and non-observables ft as an errors-in-variables model, and as a common fact related 
to such situations, we have to assume that the distribution of the ’’errors” rjt is known to have any 
chance to infer structural properties of the process {ft} from the observation {Xt}. If we retain the 
normality assumption of the LARV for the et then the corresponding rjt would be distributed like 
I times the logarithm of a xf-random variable centered such that ST)t = 0. As in chapter 1 , we 
impose some assumptions on m and on the laws of the Su T]t which, among other things, guarantee 
the existence of stationary solution of ( 6 ): 

(Bl) m is twice continuously differentiable 
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(B2) For some > 0 are i.i.d. jV'(0, i9^)-random variables, and there exist i.i.d. JV'(0, l)-random 
variables St such that rjt = |loge^ — /x^ and are i.i.d. bivariate Gaussian variables with 

corr(£:f,£t) = p for some \p\ < 1. 



As an immediate consequence of (Bl), (B2), the stationary density of the log volatilities ft is twice 
continuously differentiable. If the ft would be i.i.d. themselves, it would be possible to estimate p^ 
nonparametrically from observations Xi,...,Xt by a so-called deconvolution density estimate as 
described by Carroll and Hall (1988). The basic idea is the following: If fi, . . . ,fr would be known, 
we could estimate p^ directly by a kernel estimate as in (5). By (6), the density p^ of the Xt is the 
convolution of p^ and the known density Pn of the pt • Px = * Prr Now, we estimate the Fourier 

transform (j)x of px from Xi, . . . , Xt by the sample characteristic function 

4(w) = ^ 

^ t=l 

Then, we deconvolute, i.e. in the Fourier domain we divide by the known Fourier transform 0,^ of 
Prf, and, simultaneously, we smooth which, in the Fourier domain, becomes a multiplication with 
the scaled Fourier transform (j)K of some kernel function K. Finally, we apply an inverse Fourier 
transformation and get the desired estimate p(x, h) of p^{x) : 






( 7 ) 



This can be rewritten as a common kernel density estimate 

«*.<•) = A |«*(^) <»' 

depending on the observables Xt only. The kernel Kh depends on the bandwidth h explicitly: 

= ( 9 ) 

By analogy. Fan and Truong (1993) have proposed deconvolution kernel estimates for regression 
functions by replacing the fixed kernel K in the Nadar aya- Wat son estimate by the variable kernel 
Kh of (9). Their approach, transferred to the problem of estimating the autoregressive function m 
of (6), provides the following estimate based on data Xq, . . . , Xt 



77i(x, h) = ^ 



( 10 ) 



where Kh and p are given by (9), (8). It can be shown that m is really a consistent estimate of m : 



Theorem 2 (Franke et al., 1998): Let the Fourier transform (f>K of K have a compact support [— /c, k], 
assume (Bl), (B2), and let h converge to 0 as c/logT for some c > «7t/2. Then, 

m{x, h) = m{x) -h Op((logT)“^). 



The slow speed of convergence is typical for such nonparametric errors-in- variables models and is not 
caused by the time series structure we have assumed. It may be improved if other laws of the et are 
assumed instead of the Gaussian (compare Fan and Truong, 1993). 
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3. Neural network based time series models 

One drawback of the kernel estimates of the previous chapters is their practical limitation to low- 
dimensional situations. A straightforward generalization of (2) to higher-order models like 

St+i = St-i, St-r) + cr(5't, St-li St-r)St+l (11) 

can be handled with kernel estimates only for small r or under specific restrictions of the func- 
tions m and a. An example of the latter are the generalized additive models where m(xo, ...,Xr) = 
^ 0 ( 3 ^ 0 ) + •• + rrir{xr) and only one-dimensional functions have to be estimated nonparametrically. 

An approach which works quite well in practice and additionally allows for including exogeneous 
variables is based on neural networks which within the last decade have become a popular tool for 
solving classification, regression and forecasting problems in particular in finance (Anders, 1997). To 
give a flavour of basic ideas and typical theoretical results we first consider a regression model: 



Yt — TTl[Xt) + Et 



( 12 ) 



where we assume 

(Cl) Ai, A 2 , ... are i.i.d. d-dimensional random vectors with density p{x). £i,E 2 , ... are independent 
real random variables and S{et\Xt = x} = 0 , S{e^\Xt = x} = (t^{x) < 00 . There exist constants Cn 
such that S{\£t\'^\Xt = x} < Cn < 00 is satisfied for all x,n. 

(C2) m is continuous and bounded, cr^(x) is continuous and 0 < (5 < CTg(x) < A < 00 for all x. 

We consider only feedforward neural networks with one hidden layer for estimating the function m. 
The basic building block is the so-called neuron which provides a nonlinear transformation of a linear 
combination of the inputs x = (xi, ..., x^)' :x 'ip{uo-\-uiXi-}~ ...u^Xd) , where is the fixed activation 
function, e.g. a function of sigmoid form like the logistic function 

i>{s) = 

If we combine H of these neurons with, of course, differing weights Uk , we get as the function 
represented by the network 

H 

=vo + J2 Vhipix'wh) 

h=l 

where = {w [, . . . , uq, . . . , vh)' is the vector of network weights with w'^ = (woh , . . . , Wdh), h = 
1, . . . , if, and x' = (1, x') is the input vector augmented by a bias component 1. The parametrization 
of the network function is not unique, as certain simple symmetry operations applied to the weight 
vectors obviously do not change the value of //f(x,ii). To avoid this ambiguity we restrict d to some 
large compact set ©h where we do not have problems with identifiability. 

Network functions like /h(x,i 9) are universal approximators, i.e., for if -> 00 , they are able to 
approximate quite general functions m arbitrarily well (Hornik et al, 1989). Using a sample of noisy 
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observations (Xi, Ki), V at) from the model (12), for given H a network function is fitted to 
the data replacing the unknown by its nonlinear least squares estimate i e. the solution of 

Dn{^) = jf - MXt, t?))^ = min ! 

For N 00, converges to t?o, the parameter vector for which the corresponding network function 
fnixj'd) provides the best approximation of the true regression function m , i.e. the solution of 

= J [(m(x) - }h(x, ■d)f + a1{x)]p{x)dx = min ! 

To show that is even asymptotically Gaussian, we impose some rather weak regularity conditions 
on the network. Here, V and denote the gradient and the Hessian with respect to d. 

(C3) The activation function V’ is bounded and twice continuously differentiable with bounded 
derivatives. 

(C4) Dq{'&) has a unique global minimum at lying in the interior of 0^, and V^Do(i^o) is positive 
definite. 

To formulate the theorem, we introduce the sample version of Dq{'&) : 

^ E [("»(^‘) - . 

and we define as the minimizer of Instead of we consider its two components 

Sn — '<^N and 'dN — “do separately which asymptotically are independent. 

Theorem 3 (Pranke and Neumann, 1998): Suppose that (Cl) - (C4) are satisfied. Then, for N oo, 

i.e. y/N(dN ~ '^n) and y/N{'dN ~ ^o) are asymptotically independent normal random vectors with 
covariance matrices Ei and E 2 , respectively, where 

El = A{dor^Bi{do)A{do)-\ E 2 = A{%)-^B2{^o)A{do)-^ 
with Hi(i9) = 4 • j <7g(x)V fnix, i9) • V/^(x, i9yp(x)dx 

B 2 W = 4 ■ I (m(x) - /ff(x,t^))^V/j,(x,1^) ■ VMx,^)'p(x)dx 

and A(t?) = V^oW- 



As an immediate consequence, y/N^d^ — 1 ^ 0 ) is asymptotically normal with mean 0 and covariance 
matrix Ei 4 * E2. Remark that in the correctly specified case, where m(x) = E2 is equal to 

the zero matrix, as then 
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As for the kernel estimates of chapter 1, asymptotic normality may be improved by the bootstrap 
where due to the heteroskedasticity of the errors et of (12) the wild bootstrap has to be used (Franke 
and Neumann, 1998). In a series of papers, White (1989a, b; 1990) has not only shown asymptotic 
normality of §n for given H and the dependence of the asymptotic covariance matrix of \/N {'dN — 'do) 
on the numerical procedure used for calculating the nonlinear least squares estimate, but he has also 
shown that the neural network function estimate of m is consistent if the number H of 

neurons in the hidden layer increases with the sample size N with an appropriate rate. White does not 
only discuss the regression case with independent data (Xi, Vi), ..., (X^v, Yn), but he allows {Xt, Yt) to 
be generated by a stationary stochastic process satisfying appropriate mixing conditions. Therefore, 
an analogon of Theorem 3 holds in the time series context too under suitable conditions, e.g. for 
estimating the conditional expectation of St+i given the past in a model like 

St+i = m(5t, iSt-i, ..., 5t-T, Xt) -h £t+i (13) 

where St is a real valued time series to be predicted, Xt is a d-dimensional time series of exogeneous 
variables and suitable transformations of past data Sk^k <t, and the et are independent real random 
variables. 

To illustrate the performance of neural network estimates for the autoregression function m of (13), 
we consider as an example the forecasting of stock prices. The data are daily closing prices of 28 
major Dutch stocks which dominate the CBS index for the years 1993 - 1996. We use the data up 
to the end of 1995 for preliminary model selection, i.e. for deciding on the number H of neurons and 
on the number and type of inputs used for forecasting, and for estimation the model parameters. 
The 1996 data are used for the validation part of the procedure allowing to test the performance of 
various models on data not used for model building and parameter estimation. Additionally to the 
raw stock prices, we implement various transformations like moving averages, envelopes or average 
directional movement indicators commonly used in technical market analysis (Muller and Nietzer, 
1993, Welcker, 1994) as well as several intermarket data like index prices, foreign exchange rates, 
international interest rates or the MG base metal price as inputs in our models. They correspond to 
the exogeneous variables Xt in (13). 

The given data are used to design a procedure for calculating 3-months- (60 trading days-) forecasts 
of the stock prices which serve as the basis for developping a trading strategy. Point forecasts of 
time series over such a large lag are notoriously unreliable. Therefore, we concentrated our efforts on 
predicting the trend correctly, i.e. if the stock price will increase significantly (i.e. move upwards by 
more than 5% of its original value), decrease significantly (i.e. move downwards by more than 5% of 
its original value), or stay essentially at the same level (i.e. does not change by more than 5%). 

We tested various neural network functions fniSu St-iy St-r^ ^) as approximations of m in (13) 

where the inputs were selected using knowledge of experts and statistical model selection procedures. 
As the model showing the best performance we finally decided for a feedforward network with H = 3 
neurons using d = 25 input variables (stock prices, technical indicators and intermarket data) out 
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of more than 60 candidates. Including a constant input, this model uses 82 network weights. The 
corresponding network function fn could probably be simplified if some of the network connections 
between input layer and hidden layer would be eliminated, i.e. some of the weights wjh are set to 0 
in advance. As the software used during this project did not allow for fitting incompletely connected 
networks we did not test this option. The least-squares estimates 'On of these parameters 'd were 
calculated using Jacobs enhanced backpropagation (Jacobs, 1988). Details of the model building 
including comparisons of various numerical least-squares procedures are discussed in (Decker, 1998). 

To test the performance of the forecasting procedure we predicted the price trend of all the 28 stocks 
at the end of each quarter of 1996. The results are given in the following table. The second column 
shows the percentage of correct classifications of the future price trends. As we allow for three dif- 
ferent qualitative predictions, a purely random forecast would have a chance of 33% to be correct. 
The neural network does better but the performance is not outstanding considering the number of 
misclassifications. However, the chance to predict a significant increase of the price correctly is usu- 
ally much larger. This is refiected by looking at the performance of a network buy and hold trading 
strategy: buy those stocks whose prices are predicted to rise significantly, hold them up to the end of 
the 60-days prediction interval and sell them afterwards. This strategy is compared to a index based 
buy and hold strategy where all 28 of the CBS stocks considered in the study are bought and held for 
60 days. The return of both trading schemes is given in columns 3 and 4 of the table. The network 
strategy outperforms the index benchmark considerable in each quarter. This is quite remarkable, 
as, except for the second quarter, the Dutch stock market experienced a period of rapid growth in 
1996, and it is commonly acknowledged by practitioners that it is not easy to beat the index under 
those circumstances. 



time 


prob. of correct forecast 


return network 


return index 


1. quarter 


43.30% 


15.41% 


11.22% 


2. quarter 


46.82% 


2.38% 


-0.34% 


3. quarter 


63.42% 


6.94% 


5.75% 


4. quarter 


55.05% 


13.17% 


11.84% 


annual average 


52.46% 


9.76% 


7.22% 



It is worth mentioning that all of the 28 stock price time series considered were forecasted using ex- 
actly the same neural network, and the weights were estimated using information from all 28 stocks 
simultaneously. Finally, we checked the stability of the forecasting procedure if new information 
becomes available. Our original network uses weights estimated from information available end of 
1995. Now, we added successively the data from January 1996 up to June 1996 and reestimated 
the prediction parameters based on each of those growing data sets, getting 6 new networks. We 
used each of those networks to forecast the price trend in the last quarter of 1996. The following 
table shows again the percentage of correct forecasts and the returns of the network trading strategy 
compared to the index-based strategy for the last quarter. The first row of the table repeats the 
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performance of the original network. Looking at the table, there is no systematic improvement in the 
performance of the network if additional information is included for parameter estimation. There- 
fore, it does not seem to be necessary to redesign the network frequently; the forecast procedure may 
be used unchanged for some time which is of some importance in practice as the construction of a 
suitable prediction network requires a considerable effort. 



data up to end of 


prob. of correct forecast 


return network 


return index 


12/95 


55.05% 


13.17% 


11.84% 


1/96 


49.38% 


13.02% 


11.84% 


2/96 


61.85% 


15.61% 


11.84% 


3/96 


51.36% 


14.99% 


11.84% 


4/96 


59.07% 


13.52% 


11.84% 


5/96 


62.93% 


13.75% 


11.84% 


6/96 


46.26% 


14.11% 


11.84% 
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On Statistical Event-History Analysis and Its Application 

Petr Volf, UTIA Prague and TU Liberec 

Summary: The concept of the counting process is recalled and the new idea of the ’cumulative’ 

process is presented. The counting process describes the sequence of events and is characterized by 
the intensity. By the cumulative process we understand a stochastic process which cumulates random 
increments at random moments. It is described by an intensity (of the underlying counting process) 
and by a distribution of increments. We consider several schemes of these processes and we give an 
example of application to analysis of financial transactions. 

Key words: counting process, cumulative proces, hazard function/intensity, Cox model, mathe- 
matical statistics. 

1. Introduction 

The event-history analysis is a branch of mathematical statistics studying and modelling the random 
sequences of events. Such sequences of events are characterized by the intensity with which the events 
occur. The latest ideas and techniques of event-history analysis are connected with the theory and 
application of counting processes. The progress in this direction was initialized by the development 
of statistical survival analysis. From the seventies, the counting processes approach has been studied 
by many authors (cf. monograph of Andersen et al., 1993), with applications mostly in biostatistics 
and medical research. In the area of social sciences, already the fundamental paper of Heckman and 
Singer (1984) deals with continuous time models (i. e. models of hazard rates). Further progress in 
application of counting processes in the area of econometrics and social studies can be found in the 
“French school” (cf. Gourieroux, 1988). 

The objective of the present paper consists in the presentation of the statistical methodology for 
analysis and systematic modelling (via the hazard-based models) of processes of events (and of 
duration). First, the definition of counting process is recalled. We review the way of application 
of the event-history analysis, including the hazard-regression models (e.g. the Cox’s one). We 
then introduce the process cumulating random increments at random moments (the cumulative 
process) and show its application to analysis of financial time series. We also derive the methods 
of assessing the goodness-of-fit of the model, namely the method based on analysis of generalized 
residuals (inspired by the method of Arjas, 1988). 

2. Counting process and its description 

An n- variate counting process N{i) = {ATi(f), A 2 (t), . . . ,iV„(t)} is a stochastic process having n 
components A,(t), each of them counting the number of observed specified events. So that with 
the help of N(t) we can describe n types of events, or we can follow the occurrence of one type of 
event, for n objects. The time t runs as a rule from 0 to some finite T at which the observation is 
terminated and the data are collected. It could be the calendar time, in other cases it could be a 
relative time shifted individually for each object of the study. It is assumed that Ai(0) = 0 at the 
beginning. A review of theory and applications of counting process models is given, for instance, in 
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Andersen et al. (1993), Fleming and Harrington (1991). 

The moments of events are random. This randomness is modelled with the help of the hazard 
function. If we consider a continuous random variable Y describing the time to the event, its 
probability distribution can be characterized by a density function ip{y) and a distribution function 
F{y). The hazard function of random variable Y is then defined as h{y) = ip{y) / (1 - F{y)) on the 
domain where F(y) < 1. In the more general setting of the counting process, the hazard function 
h{t) is simply a nonnegative, bounded, smooth (e.g. continuous) function defined for t e [0,T]. 
H{t) = /o h{s)ds is the cumulative hazard function. 

Let us now define, for each 7Vi(t), the indicator process Ii(t) in such a way that Ii(t) = 1 if the 
stream of relevant events is observed at moment t, Ii(t) = 0 otherwise - i. e. during the part, of 
[0, T] in which the history of i-th object is censored. In other words, Ni(t) is exposed to the risk of 
the count only if Ii(t) = 1. The function Aj(t) = Ii(t) h(t) is called the intensity, it is the (random) 
function which governs directly the behaviour of Ni(t). Finally, define the cumulative intensity as 
Li{t) = /o Aj(s) ds = Jq Ii(s)dH{s). Trajectories of Li{t) are continuous and nondecreasing, Ni{t) are 
taken as continuous from the right side, meanwhile Ii{i) are left-continuous. 

Further, let us consider a left-continuous nondecreasing sequence of cr-algebras, «S(t), where each S{t) 
is defined on the sample space of {Ni(s), Ii{s), i = 1, ...,n, 0 < s < t}. Actually, <S(t) collects the 
’’history” of the process preceding the moment t. We assume that processes Ni(t) are «S(t)-measurable 
and that (rather formally) P{dNi{t) = 1 1 5(t)) = A»(t) dt. 

Let us now recall the martingale-compensator decomposition of the counting process, namely the 
fact that Ni(t) = Li{t) -h Mj(t), where Mi(t) is a martingale adapted to the cr-algebras S(t) (i.e. it is 
«S(t)-measurable). It holds that E Mi(t) = 0 and variance processes (Ni){t) = {Mi){t) = Li(t), where 
the notation (Ni){t) means var(Ni{t)\S{t)). Similarly as the paths of Ni{t), the paths of Mi(t) are 
also right-continuous. From the rather theoretical (not fully realistic) assumption that there are not 
two events at the same moment it also follows that {Mi,Mj)(t) = 0, for i ^ j, the martingales are 
mutually orthogonal. 

2.1 On application of counting processes 

The applications are mostly connected with regression models. The model of regression means that 
the intensity depends (in a way specified by the model) on some explaining variables, predictors, 
covariates. The values of covariates can again be given by an observed random process depending on 
the time. So that let us consider processes of covariates Xi{t), e = 1, . . . , n, each connected with z-th 
individual and directly influencing the intensity of counting process Ni(t). To construct a model of 
this dependence, assume that there is a (smooth) hazard function h{t,x) from [0,T] x A' to [0, oo), 
where X is the domain of values of all Xj(t). In general, X C , provided K different covariates 
are considered. The intensity process of Ni{t) is then 

= ( 1 ) 

so that it is actually a random process, too. It is seen that, although Xi(t) can generally depend on 
the pre-t history of other parts of the system, in model (1) the intensity depends explicitly only on 
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actual states (just before t) of processes Xi{t). It does not mean any loss of generality, because 
we can incorporate all necessary information about the past into properly chosen covariates. The 
only limitation is the practical feasibility of analysis of such a model. 

In order to make the setting more tractable (from the points of mathematical theory), it is assumed 
that, similarly as processes /»(t), the processes Xi(t) are bounded and have their trajectories con- 
tinuous from the left side. Further, it is assumed that Ii{t) is observable throughout the whole [0,T] 
and that Xj(t) is observed at least when Ii{t) = 1. 

2.1.1 Examples of regression models 

The model of hazard function specifies the form of h{t^x). The parametric model means that this 
dependence is described via a finite number of parameters. The most frequent types of parametric 
distributions are e. g. the exponential, Weibull, lognormal, gamma distribution. For instance, the 
hazard rate of Weibull distribution is hw{t) = where a, /3 are parameters. If a (or /? or 

both) depends on a covariate x, the model expresses the regression of hazard. Again, functions a(x), 
I3{x) can be specified with the help of other parameters, or they can remain in the general form. The 
model containing both nonparametrized and parametrized parts is called the semiparametric model. 
The use of parametric models is much more popular in common statistical praxis (and it is also easier). 
However, the analyst should be aware of the danger of inproper choice of the model. Therefore it 
should be preferred to perform at least the preliminary analysis (for instance a graphical one) on a 
rather general, nonparametrized level. Even the nonparametric models offer a rich choice of structure 
of dependence. 

The idea of separation of common hazard rate from the influence of covariates leads to the multi- 
plicative model, called also the proportional hazard model, 

/i(t, x) = ho(t) • exp(6(x)). 

Function ho{t) is the baseline hazard function, b(x) is the regression (response) function. Both 
functions can be further specified, for instance the additive response function is frequently considered, 
i. e. b(x) = ZjLi If fhe response function is parametrized, we obtain the semiparametric Cox 

model. Its most popular form assumes a linear response function, namely h(t,x) = ho(t) exp(/3'x). 

2.1.2 Examples of application 

Besides in the area of medical research, the Cox model has been used in a number of demographic and 
social studies. Let us mention here several studies analysing the (un) employment data. The event 
of the interest is the change (loss, regain) of the job. The objective is the evaluation and prediction 
of hazard rate (of corresponding event) including the dependence on a set of social, demographic, 
economic covariates. For instance, Han and Hausman (1990) discussed the problem of competing risks 
(the case when several types of events can occur and each type excludes the others). Gamerman 
and West (1987) used the Cox model with time-dependent ’parameter’ /3(t) which described the 
change of influence of covariates during the time. Volf (1993) considered a general nonparametrized 
multiplicative model of hazard rate for analysis of differences (and possible discrimination) between 
different groups of employees. 
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2.2 On methods of model evaluation 

The standard methodology of estimation of (the parts of) hazard rate models is collected elsewhere. 
Mainly in monographs devoted to statistical survival analysis, e.g. in Fleming and Harrington 
(1991), Andersen et al. (1993). The aim is to estimate consistently the unknown parts of the model 
(parameters, response functions, baseline hazard function etc.). The consistency means that with 
increasing amount of information, e. g. with increasing number of observed objects, the estimate 
converges to the “true” model (of course, in an ideal world of mathematical models). The methods 
are mostly based on the maximum likelihood principle. In the case of the Cox model, the partial 
likelihood is maximized (in an iterative way). The maximum likelihood approach is used also in 
quite general cases of nonparametrized models. The method is known as the local scoring procedure 
(Hastie and Tibshirani 1990, Volf 1993) and is based on the sequential local maximization. 

3. Cumulative process 

As we have seen, the counting process is a stochastic point process registering random events and 
counting their number. Let us now consider a random process 

C(t) = lW(s)dN(s), (2) 

where N{t) is now a one-dimensional counting process and Y (t) is a set of random variables. The 
time runs through [0, T] and starting value is again C(0) = 0. The objective is to describe the process 
C{t) with the aid of characteristics of both its components, i. e. the hazard function of N{t) and the 
distribution of V(t). As regards variables Y(t), in the simplest scenario we assume that: 

1. Y{t) are distributed with (unknown) densities 

2. Their means /j,{t) and variances a^{t) are continuous functions on [0,T]. 

3. Each Y{t) is independent of 5(t), i. e. of the history of process C{s) up to t. 

The first task is to derive a compensator of the process C{t). We utilize the decomposition N{t) = 
L(t) -I- M{t) of the counting process. Denote Y*{t) = Y{t) — /i(t). Then we can decompose 

C{t) = Af*(s) + /x(s))diV(s) = f fjL(s)dL{s)-\-M{t), 

Jo Jo 

where 

M(t) = Mi(t) +Ad 2 (t) = tY*{s)dN(s) fi{s)dM{s). 

Jo Jo 

Proposition 1. The processes At(t), Adi(t), M 2 (t) are martingales adapted to cr-algebras S{t) 
on [0,T], the variance process of M{t) is 

{M)(t) = f (ff^(s) + /i*(s)) di(s). 

Jo 

Proof: Evidently, E M{t) = 0. As regards the property defining the martingale, we have for 
0 < s <t that 

£;(><(<) l.S(s)) = M{s) + E y* dM{T)\S{s)^ = M(s), 
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because the same holds for both its parts: For Mi(t) it follows from centering of Y*{t) and from 
its independence of N{t). Properties of M 2 (t) follow directly from properties of M{t). From the 
independence of Y*{i) on the past up to t it also follows that Y*{t) is orthogonal both to dM{t) and 
to dN{t). Therefore 

{Mi)(t) = f a^{s)dL(s), {MuMi){t) - 0. 

Jo 

Further, from martingale properties of M{t) we have that {M 2 ){t) = fo M^(s) dL(s). Then 
(M)(t) = E + M2{tf + 2Mi{t) M2{t) \ S{t~)} 

= [ c7^(s)dL(s)+ f /x^(s)dL(s). 

Jo Jo 

Corollary. Process Jq /x(s) dL(s) is the compensator of process C(t) on [0,T]. 



4. Large sample properties 

In the follow-up, let us imagine that n realizations Ci(t) of a cumulative process C(t) are observed on 
[0,T]. More precisely, we observe moments of events Tij of counting processes ATj(t), corresponding 
indicators Ii{t) and “jumps” Yi{Tij), {i = l,...,n, j = l,...,nj = Ni{T)). Formally, observed 
trajectories are 

Q(t)= iy,(s)dN,(s) = f:Y,{Tij). 

•'0 j=l 

It is assumed that random variables Yi{t),i = 1,2, ...,n have the same distributions, with densities 
f{y\t) and moments and Further, we assume that Yi{t) are independent of the common 

history of the process stored in cr-algebras S(t). Finally, we assume that Ni{t) are governed by 
the same hazard function h{t). Corresponding intensities of Ni(t) are then Aj(t) = The 

likelihood process is 

^ = ft n {Ai(T,,)/(F,(T,,); T,,)} • exp{- T A,(t) dt}. 

i=ij=i •'0 

It is seen that the part containing the intensities and the part containing the distribution of 
T-s are separated (and therefore both characteristics can be estimated independently). Denote by 
E{f) = nr=i Iljii f{yi{Tij)]Tij). Then we obtain the following log-likelihood: 

In £ = \{t) dt} + ln(£(/)). (3) 

In the case of parametrized function /, its parameters can be estimated from the maximum likeli- 
hood estimation procedure based on C(f) only. In a nonparametrized case, estimates of functions 
/i(t), a^{t) can be obtained with the help of a smoothing (kernel) technique. Even the density f{y; t) 
can be then estimated via the kernel method. However, we shall try to characterize the process C{t) 
jointly and to derive some asymptotic properties which depend on h(t), and 



4.1 Estimates and their convergence 



Let us recall here a well-known estimator of function H{t): 

-IL rtdNi(s) 
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where I{s) = I3r=i ^i(^) characterizes the risk set at moment s. Let us assume the following: 

Al. There exists the limit r{s) = limn_^oo ^ in probability such that 

a) the limit is uniform on [0, T], 

b) 1 > r{s) >€ on [0, T], for some e: > 0. 

Then Hn{t) is a consistent estimate of H(t). Further, such an estimate is asymptotically normal 
on [0,T], namely y/ri{Hn{t) - H{t)) = \/nE?=i/o converges weakly to Wiener process with 

variance function /q when n -> oo. It follows from the central limit theorems for martingales 
(e.g. Andersen et al. 1993, chapter II). Moreover, Winter, Foldes and Rejto (1978) proved a variant 
of Glivenko-Cantelli theorem and showed that Hn{t) is a strongly uniformly consistent estimator of 
H{t) on [0,T]. 

Inspired by these results, we consider the average of observed processes 

as an estimator of the function K{t) = /q /h(s) dH(s). Actually, K(t) represents the mean trajectory 
and also the model of the process C(t). 

Proposition 2. Under Al, Cn(t) is a uniformly consistent estimate of K(t) on [0,T], i.e. 
lim„_,ooSupte[o,T] l^n(t) -K(t)l=0in probability. 



Proof: 



^n(t) = E /„ 7^ = 



(4) 



Here Mi(t) are square integrable martingales, with the same distribution as M(t) defined in the 
preceding part. They are mutually orthogonal, (Mi,Mj)(t) = 0 for i ^ j, their variances are 
uniformly bounded on [0,T]. From Lenglart’s inequality (cf. Andersen and Borgan, 1985) it follows 
that for each S,e > 0 and for sufficiently large n > n{S, e) 

P{s\ip < 4 - 

t€[0,T] n e 



This, together with the convergence of ^ assumed in Al, leads to the convergence of 
suP(g(o,T] I /o Efei ^^^1 in probability to zero. 



Proposition 3. Under Al the process y/ri{Cn{t) — K{t)) converges weakly on [0,T] to a Wiener 
process (i.e. the Gaussian process with zero mean and independent increments) which has the 
variance function W{t) = Jq + cr^(s))dH(s). 

Proof: From (4) we have that y/n(Cn(t) - K(t)) = \/n/o E?=i ^ 7 ^- The statement of the 

proposition follows directly from Proposition 1. and from the central limit theorem for martingales. 
The assumptions of continuity and boundedness of functions /x(t),cr^(t), /i(t) enable us to omit any 
additional condition of Lindeberg type. 
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4.2 Testing the goodness-of-fit 

Arjas (1988) and later Volf (1996) derived goodness-of-fit tests for the counting processes model, and 
generalized them for the case of hazard regression models (namely Arjas considered the Cox model, 
Volf a general case of hazard regression model and the Aalen model). From this point of view, the 
case considered here is much simpler, because the regression is not involved. 

Let the model be given by functions i/(t), /^(t), cr^(t), we want to decide whether the data correspond 
to it. The data are represented by the observed trajectories C,(t) and indicators 7i(t), z = 1, . . . ,n. 
The tests are quite naturally based on the comparison of C„(t) with expected K{t). The process of 
differences Cn(t) — K{t) is called the residual process. 

Graphical test: Let us order all moments of events into one nondecreasing sequence Tk, k = 
1, . . . , AT. For a graphical comparison, we plot K(Tk) = /i(s) dH{s) and C(Tk) into one figure, 

against k on the abscissa. If the model fits the residual process is a martingale asymptotically tending 
to zero. Then it is expected that both plots will be close to each other. An opposite case (i.e. an 
increasing distance of both curves) indicates that the model K{t) does not correspond to the data. 
Of course, a more precise test will need a specification of critical limits for the distance of compared 
curves. Such critical bounds can be derived from the large sample properties, for instance in the 
following way. 

Numerical test: Numerical test is based on asymptotic distribution. From the result of Proposi- 
tion 3 it follows that the process 

Dn{t) = ^^(Cn(^) - K{t)) /(I + W{t)) 

is (if the model holds) asymptotically distributed as a Brownian bridge process where 

V{t) = W{t)/{1 -h W{t)), t 6 [0, T]. Hence, a test of Kolmogorov-Smirnov type can be used. From 
the theory of Brownian bridge it follows, for instance, that for any d > 0, 

P (m^Dn{t) >d^=P (min Dn{t) < -d^ ^ exp{-2(f) 

approximately. So that the value exp(-2cP), where d is the observed maxA: \Dn{Tk)\, is an approxi- 
mate p- value for the test of hypothesis of the goodness-of-fit against a proper one-sided alternative. 
Besides the goodness-of-fit tests, we can also consider the tests of homogeneity. They compare two 
(sets of) realizations of the process. A graphical comparison can be based on a slight modification of 
the method described above. On the other hand, the performance of a numerical test of homogeneity 
is limited by the fact that, as a rule, certain characteristics of the joint model have to be estimated. 
The properties of the test procedure then depend strongly on the properties of the estimator. 

5. Example - the fraud detection problem 

As an example, let us consider one-day processes of financial transactions performed via credit cards 
at a gas station, for i = l,...,n = 30 days. We follow both the number of transactions (forming 
the counting process) and also the cumulation of transferred amounts (from this we obtain the 




290 



cumulative process). Figure 1. contains an example of averaged process N(t) = and 

of estimated (and smoothed) corresponding hazard function h(t). Figure 2. shows the averaged 
cumulative process C(t) = Ci(t)j and estimated and smoothed derivative of function K(t). 

Figure 3. displays individual realizations of Ni(t) and Ci(t). Figure shows also the curves of sample 
means and 95% quantiles, respectively. 





Figure 1: Averaged counting process and estimate of its hazard rate 





Figure 2: Averaged cumulative process and estimate of its rate 

The methods of statistical event-history analysis, namely the tests described in preceding part, offer 
the tools for comparison the trajectories of processes and for detection of atypical behaviour of some 
of them. An atypical (outlying) character of the process can be caused by a fraudulent action. 
Therefore, the statistical analysis of counting and cumulative processes is directly applicable to the 
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Figure 3: Realizations of counting and cumulative processes; full lines connect the sample means and 
95% quantiles 



fraud detection problem. 

6. Conclusion 

The main advantage of the counting processes is their dynamics resulting from the conditioning the 
actual intensities by the history of the system. This area of statistical methods has a well developed 
theoretical background as well as the techniques of computational analysis. 

The main purpose of the paper was to review the methodology of statistical event-history analysis 
and to suggest several modifications and new directions of the research. The counting process can 
be naturally generalized to so called marked process (Arjas, 1989), where the marks represent the 
states of objects. We considered here another generalization consisting in the combination of the 
counting process with the process of random increments (the cumulative process). Such models are 
suitable for description of many real-world technological, environmental and also financial processes. 
We derived tools for modelling and statistical analysis of such situations, especially for testing the 
agreement of the data with the model. 

As regards the generalization of the case studied in the present paper, the first one would consider a 
functional model for the hazard function describing also the influence of the history of Ci{t) on the 
actual intensity. For instance, regression models (variants of Cox model, say) are available for such 
a case. Another generalization would omit the assumption about independence of variables Yi{t) on 
the history and would deal with increments generated by a specific random process model. 
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Detection of Compatible Turning-Points and 
Signal-Extraction of Non-Stationary Time Series 

M. Wildi, University of St. Gallon 

Summary: Traditionally, trend-estimation or -extraction methods for non-stationary time series 
(ARIMA-model based, structural models. Census X12,...) make use of a stochastic modeling proce- 
dure which not only determines an optimal symmetric MA(cx))-extraction filter (this is not true for 
Census XI 2) but also supplies missing values at both ends of a finite sample by optimal fore- and 
backcasts, hence minimizing the unconditional final revision variance. In this paper we propose a new 
trend estimation procedure based on a direct filtering approach. We generalize the class of time invari- 
ant filters by including explicit time dependence towards the end of a sample and optimizing in each 
time point a corresponding filter with respect to a conditional final revision variance minimization. 
The condition corresponds to a time delay restriction and this will generalize usual unconditional op- 
timization procedures. It is shown that this optimization underlies an uncertainty-principle (APUP) 
which is best solved by general HR- or ARMA-filters instead of the usual MA-designs. This direct 
IIR-filter-method may be used either for traditional trend extraction or for detection of compatible 
turning-points of a series (to be defined below). In the latter case it is shown that the theoretical 
extraction filter has a transferfunction taking the form of an indicator function. 

1. Introduction 

Many traditional trend-estimation procedures use a stochastic-modeling approach determining the 
optimal extraction filter for doubly infinite (infinite past and future) samples and provide optimal 
fore- and backcasts, which are intended to extend finite samples on both ends. In this paper we shall 
briefly analyze a more direct filtering approach. Section 2 introduces time-invariant systems, MA- and 
ARMA- or IIR(Infinite-Impulse-Response)-filters, section 3.briefly describes frequency-characteristics 
of a filter and introduces an uncertainty principle of linear filters. In section 4. we first define the 
problem of compatible turning-point-detection for doubly infinite samples and extend the results to 
finite samples, resulting in a constrained optimization problem. Finally, section S.extends the direct 
filter approach to traditional trend-estimation procedures and section 6. concludes by some general 
remarks about trend-extraction. 

2. Filters as Linear Time-Invariant-Systems 

A discrete-time system is essentially an algorithm for converting a sequence (a realization of an 
input stochastic process Xt) into another sequence (a realization of an output process It). A linear 
time-invariant-system may be represented by a convolutional relation between the input Xt and the 
output Yt (see Rabiner,Gold (1975)) : 

Yt= t, b,Xt-k 

k=—oo 

We call such a system a (linear and time-invariant) filter. In the following we shall use the so called 
recursive or ARMA or also Infinite- Impulse- Response (IIR)-filter-class which is defined by: 

= i hXt-k 

k=l k=-r 
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It is more general than the traditional (in general symmetric) MA-filter for which p = 0,r = ^, 6^ = 
b-k and it shares the property of parsimony with the class of ARMA-models as invoked by Box and 
Jenkins (1970). In the frequency domain, the above equation may be rewritten as : 



r(o;) = Cexp{—iur) 



YlltloiZk - exp(wt)) 
nLi(^* - exp(iw/t)) 



where r(a;) is called the transfer function and Zk^ Pk are the zeroes and poles of the transfer function 
of the filter (note that MA-filters do have zeroes only). The filter is real if all complex poles and 
zeroes occur in complex conjugate pairs and it is called minimum phase if all zeroes and poles lie 
outside the unit circle. In the latter case it is stable and causal (r = 0), see Oppenheim, Schafer 
(1975). Since in general we are also interested in non-causal filters (r > 0) we shall restrict attention 
to filters which may be made minimum phase by removing r zeroes at 0. 



3. Phase- and Amplitude-Functions : an Uncertainty Principle 



Amplitude- and phase-functions of a filter (bk)kez are defined by A{(jj) := |r(a;)| and $(o;) := 
arg(r(o;)). A{u) is the weight the filter applies to a wave of frequency u (damping or amplifying) 
and (f){u) := ^{u)/u) is the time delay of the filter output for a wave of frequency u. It may be 
shown (see for example Oppenheim, Schafer (1975)) that among all filters with identical amplitude 
functions, the minimum phase filter minimizes the time delay of the filter output Yt. Clearly this is 
a desirable property since in many applications (like in economics) one is interested in the actuality 
of the filter output Yt. 

Often one is interested in designing a filter which meets some intended amplitude and phase function 
characteristics. For the ideal low pass filter with cutoff frequency A the transfer function is defined 
by the Indicator function H{cj) = I{u,<=x] (see below) so that the phase is zero and the amplitude 
becomes discontinuous. Unfortunately both functions are related so that both conditions cannot 
be met simultaneously : for minimum phase filters the complex transferfunction may be completely 
reconstructed from either the amplitude- or (a suitably defined version of) the phase-function, see 
for example Oppenheim, Schafer (1975), section 7.1. Another problem arises from the ambiguity of 
the traditional definition of the phase-function as given by arctan(7m(r(o;))/i2e(r(a;)). A properly 
defined analytic extension for minimum phase filters is to be found in Wildi (1999). 

It is shown from a geometrical argument in Wildi (1999) that to a first order of approximation 
amplitude and time delay of a an AR(1) low pass filter are related by 



4(w) 



« Ku 



where r(a;) = 1/A{u), K is a constant and the derivatives are taken with respect to u. 

Clearly to a first order of approximation the phase function increases linearly with the negative deriva- 
tive of the amplitude-function : so if a filter becomes very sharp (nearly discontinuous amplitude 
function) then the time delay will tend to oo. This will be called the Amplitude-Phase-Uncertainty- 
Principle (APUP) : small time delays imply poor frequency discrimination and conversely. 
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4. Detection of Compatible Turning-Points 

4.1 Ideal Detection- Tool in Doubly Infinite Samples 

We shall first define our approach of turning-point detection for a doubly infinite sample and then 
extend it to finite samples. 

Definition : An ideal procedure should detect all and only these turning-points of a given finite 
sample of a stochastic process which satisfy the following two properties : 

-the mean-duration between two consecutive turning-points is at least A time units, where A is a 
parameter specified a priori (by the user) and 

-the turning-points are compatible with the process-generating-mechanism. 

Definition : A compatible turning-point belongs to a smoothed compatible component Vt of Xt sat- 
isfying the following properties : 

-Vt is monotonic between two consecutive turning-points (which are points of trend reversion) and 
-Vt retains all relevant information (of the sample), contains no irrelevant information (of the sample) 
and does not show any exogenous or spurious information (like Slutzky- effects for example). 

Before we proceed to a solution of the above problem, let us note that the detection of compatible 
turning-points focuses on the long term (low frequency) behaviour as defined by the parameter A 
of a series and as such adresses a prospective (forecasting) perspective at the upper end of a given 
sample. Besides this, the elimination of the high frequency part (generally associated with some 
noise-process) reveals the long term behaviour of a series in a historical perspective. 

It is shown in Wildi (1999) that a compatible smoothed version Vt of mean-duration A of a given 
doubly infinite sample X^t = — oo, ...,oo is uniquely defined by the output of an ideal filter with 
(real) transfer function r(o;) = /{u;<a}, where /{u;<a} = 1 for cj < A and 0 else, is an indicator function. 
Such an ideal low-pass filter may be approximated arbitrarily closely in mean square by an infinite 
symmetric MA-filter (note that although the time delay of the filter response would be infinite, this 
would not matter since the sample is doubly infinite). Regarding the finiteness of a sample, we know 
from section 3. that such an ideal filter may not be approximated arbitrarily closely, since the time 
delay and the amplitude functions are antagonistic. We therefore have to perform some optimization 
procedure. 

4.2 Optimization for Finite Samples 

A straightforward optimization procedure would minimize 

E[{Yt - Yt)^] -> min (1) 

where Yt is defined to be the (unknown) output of the ideal filter. Since the true structure of the 
process underlying the input signal Xt is generally unknown, it is shown in Wildi (1999), that the 
minimization of 




296 



/ |r*(w)exp(iwA:) -/{„<>) p/(w)rfw (2) 

J—ir 

where f{u) is defined to be the periodogram of provides an approximation up to 0(1/N) of 
~ where Yt and Yt are the outputs of the ideal and of the approximating filters 
respectively. For stationary inputs this result follows for example from Priestley (1981). For non- 
stationary inputs it is shown in Wildi (1999), which also proves that smoothed periodogram estimates 
of the (pseudo) spectral density are inefficient because of the underlying non-stationarities. 
Formally, one should compute A; = 1, ..., TV different filters F{k), each beeing optimal for a correspond- 
ing time point iV — A; in the sample. However, most series in practice are sufficiently well behaved so 
that Fk ^ FkQ,k > ko, in mean square and for moderate ko (until yet no practical example implied 
a A:o > 6). Note first that IIR-filters generally outperform MA-designs, because they allow for more 
general frequency characteristics and second that by the above we have defined a direct filter approach 
: instead of fitting a model to the data, we fit an IIR-filter to the ideal transferfunction. The informa- 
tion in the sample is used only as a weighting function of the approximation in the frequency domain. 

^From a prospective perspective, one is often interested in the detection of turning-points near or at 
the end of a finite sample. Whereas 1 and 2 provide the best (mean square) approximation of the 
level of the smoothed version Vt of Xf, the optimization of this criterion unfortunately often implies 
a non-negligible time delay of the filter output Yt near the end t = iV of a series. Therefore, we 
provide a solution by minimizing a conditional variance criterion implying a time delay restriction. 
Consider the identity : 

\fk{Lj) exp{iujk) - Ip = |f/b(o;)p + 1 - 2|ffc(o;)| cos($(a;) - ku) (3) 

By expanding the cosinus up to second order terms, setting 3 into 2 and minimizing with respect to 
a time delay condition, one obtains the minimization of 

^ {A{u)) - lflx{uj)duj + f ($(w) - kujflx{u;)du + A(w)^Ix{w]doj (4) 

J—X J—X JX(—ir) 

subject to 

< 5k (5) 

where Ix{^) is the periodogram of the input signal and <f>{u) := ^{u)/u in 5 is the time delay of the 
filter, see section 3. For various reasons explained in Wildi (1999) we use a penalty-function algorithm 
coupled with a Newton- Raphson optimization procedure to perform the above constrained minimiza- 
tion. Equation 4 reveales the APUP brought into a form intended to minimize the variance between 
ideal and approximated filter outputs. One also sees that although IIR-filters have non-linear phases 
(see for example Rabiner, Gold (1975)), they are intrinsically better suited to optimize the APUP of 
section 2, since the optimization procedure involves both amplitude- and phase-functions. It is readily 
seen, that the above conditional minimization procedure makes fully use of an analytic extension of 
the phase-function. Finally, note that the second order approximation to the cosinus is merely used 
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for accelerating the convergence, since essentially an ambiguous trigonometric function is replaced 
by a convex quadratic function (results show that the convergence is indeed heavily increased, see 
Wildi (1999)). 

Remark : in order to avoid the well known Gibbs Phaenomenon (see for example Priestley (1981)) 
one may replace the discontinuous ideal indicator function by some continuous function with a 
narrow cross band between pass and stop band. Alternatively one may simply occult frequencies 
corresponding to the artificial crossband in the optimization procedure, implying independence of 
the resulting filter approximation of the shape of the transferfunction in the cross band (so that one 
may implicitely assume some optimality of the ideal transferfunction, see Wildi (1999)). 

5. Signal Extraction 

Although the detection of compatible turning-points as defined in section 4.might be an important 
approach to an investigation of the long term behaviour of a series, there exist many other approaches 
for tackling this problem. Well known examples are 

-the ARIMA-approach (see Kitagawa, Gersch (1984), the canonical decomposition in Hillmer, Tiao 
(1992) or the Beveridge-Nelson decomposition (1981)) 

-the ’’state space” approach (see Harvey (1989)) 

-the Census Xli, i=l,2, see Findley (1998). 

Despite methodological discrepancies concerning im- or explicit definitions of the ’’long term be- 
haviour” of a given series implied by these methods, one may be tempted to unify them on a ’’lower 
level” by concentrating on extraction procedures only. In fact, once our motivation of detecting 
compatible turning points is dropped, the choice of the indicator function in section 4.may become 
somewhat arbitrary. Evidently, this choice may be replaced by any theoretically motivated definition 
of some ideal transfer function. For the determination of an optimal approximation, we may then 
use the procedure of section 4. Although the above methods are all optimal under some more or 
less restrictive assumptions concerning the true (unknown) structure of the data generating process, 
some additional gain of information may be expected by using our procedure : 

-First, towards the end of a series, our approach generalizes the unconditional mean square mini- 
mization procedure by including a time delay constraint. 

-Second, some of these methods make more or less strong assumptions about the process generating 
the input signal (integration order for example) as well as about the adequacy of the estimated mod- 
els. Often multi-step ahead forecasts (of quite high orders) of models optimized for one step forecasts 
only are used in supplying the missing input values for some ideal doubly infinite MA extraction filter. 
Whereas such designs may be well suited for some artificial simulation context, their extension to 
practical time series may be questionable. In fact model-mispecification (based on wrong integration 
orders for example) would not only lead to inadequat extraction filters (and hence non-intended long 
term behaviour) supplied by suboptimal forecasts, but would also complexify diagnostic procedures, 
like the computation of the variance of the revision error for example. 

In this context, a direct filter approach approximating some theoretically motivated ideal transfer 
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function may circumvent some of the above difficulties (see Wildi (1999)). Comparisons of our IIR- 
procedure with the above methods are to be found in Wildi (1999). 



6. Conclusion 

The essential methodological difference between traditional signal-extraction methods (as mentioned 
in section 4) and the above direct filtering approach consists in the fact that the former make use 
of an explicit stochastic model of the input process Xt for extracting some model-based signal from 
some model-based nuisance components. Both the detection of compatible turning-points (section 
4) and the signal extraction (section 5) result from a direct filter approximation problem : rather 
than fitting a model to the data of the sample, we fit a filter to some theoretically motivated transfer 
function (which may be for example a traditional extraction- or a turning-point detection filter) 
and the information of the sample is used only for weighting the approximation through the whole 
frequency spectrum. For this, we need only to compute the (non-parametric) periodogram of Xt. 

The choice of the ideal transfer function may be based on theoretical and/or practical reasons. 
We are tempted to classify the detection of compatible turning-points as well as the Census XI 2 
method into the second class, whereas the first one better suites to the model-based procedures. 
It remains to say that tremendous differences between the estimated trends of the various model- 
based methods arise in practice, depending on which assumptions are used (or in other words : 
which model). Whereas some methods rely on a kind of ’’smoothness” criterion (like the canonical 
decomposition, which in some sense arbitrarily choses among the set of consistent solutions the 
unique one for which the (pseudo) spectral density is zero at the frequency tt) or on the flexibility of 
the eventual forecasting function (higher integration orders allow for higher polynomial orders of the 
eventual forecasting functions, see Harvey (1989)) implying more or less smooth trends, others imply 
trends which might well be more erratic than the original series, see for example Beveridge, Nelson 
(1981). Finally, whereas some methods rely on the assumptions of the additive orthogonal component 
model, some assume dependence of the components (like the Beveridge-Nelson procedure). Well 
founded critics against the orthogonal component model, based on economic considerations, are 
given in Hylleberg (1998) and Nerlove, Grether, Carvalho (1979) for example. Without restrictive 
assumptions about the structure of a stochastic process, the trend component can neither be uniquely 
defined nor observed nor extracted from a realization of the process. This fundamental undeterminacy 
is responsible for the variety of trends produced by different methods, for which various underlying 
assumptions fill in (more or less successfully) lack of information. In our view, the detection of 
compatible turning-points, with its appealing prospective content at the end of a series, though 
not motivated from economic theories or model-based approaches, offers a good complement to 
traditional trend extraction methods. From a practical point of view, we think that our method, 
which has been optimized in view of a prospective goal (recall the time delay restriction), might well 
supply traditional methods by some ’’new” (or should we say ’’actual”?) information. 
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Summary: The paper provides an alternative, but direct proof of the known fact that the Shapley 
value of every average-convex transferable utility game is included in the core. The study about the 
membership to the core is extended to the class of efficient, linear and symmetric values for TU-games. 
For such a value, a recursive formula is presented and used as a basic tool to establish the membership 
of the value to the core whenever the game satisfies an extended version of average-convexity. 
Keywords: TU-game, core, value, average-convexity 
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1. Average-convex games 

We aim to prove that, for a particular type of coalitional games called average-convex games, the 
Shapley value belongs to the core. A (cooperative) TU-game (with transferable utility) is a real- 
valued function v : 2^ JR defined on the power set 2^ of a finite set N. An element of N (notation: 
i G N) and a nonempty subset S of N (notation: S C N oi S e 2^ with 5 ^ 0) is called a player and 
coalition respectively, and the associated real number v{S) is called the worth of coalition S. The 
cardinality (size) of coalition S is denoted by |S| or, if no ambiguity is possible, by s. Particularly, 
n denotes the size of the player set N. The worth of coalition S represents the total amount of 
(monetary) profits that the members of S can achieve by cooperation, without the help of the other 
players. By convention, v(0) := 0. 

For the solution part of TU-games (that will be treated in Sections 2-3), it is of great importance 
to have knowledge of possible properties of the game. The first stage of research was devoted to 
the well-known convexity (or super-modularity) notion for TU-games and subsequently, a variety 
of convexity notions has been introduced and investigated. As one motivation among others, the 
so-called average-convexity may be regarded as one possible extension of convexity, if one adopts the 
principle on the average. A game i; : 2^ — ^ IR is said to be average- convex if it holds that 

Y: b(ii) - «(A{i})] < E MP) - ’^(n{i})] for all C P C N. 
ieR i€R 

Average-convex games were introduced, independently from each other, by Sprumont (cf. [5]) and 
Inarra and Usategui (cf. [2]). Average-convex games are called quasi-convex games in Sprumont ’s 
work. In both papers it is shown that the Shapley value of an average-convex game belongs to the 
core (cf. [5], Proposition 5, page 387; cf. [2], Theorem 2, page 16). Unlike our forthcoming alternative 
proof of the inclusion, Sprumont’s proof involves a recursive formula for the Shapley value (in other 
words, the Shapley value of any player in an n-person game is determined recursively with the help of 
the sum of the player’s Shapley value in subgames with n - 1 players; see Section 3). Our alternative 
proof, however, resembles Inarra and Usategui’s proof, but their proof is much more complicated 
and lengthy since their approach involves the function : 2^ x 2^ IR associated with a game 
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V : 2^ -&]R as follows: 

g^(S, T) := v{S U T) - v{S) - v{T) for all 5, T G 2^. 

In words, the expression p^(5,T) represents the relative gain (or loss, if it is negative) whenever two 
coalitions 5, T (that are supposed to be disjoint) merge and form their union. Inarra and Usategui’s 
definition of average-convexity of a game v was based on the principle on the average applied to the 
associated function g'^ (cf. [2], Definition 3, page 15) and is as follows: 

g'’{P\R, /?) > ^ . 5] P\{i}) for all P C P C AT. 

1^1 i^R 

For the sake of simplification of the exposition, in our approach we mainly deal with the “restriction” 
of the function : 2^ x 2^ IR to 2^ x AT, that is we are only interested in the relative gains in the 
case where a single player joins a coalition. To be precise, we focus on the function : 2^ x iV -> IR 
associated with a game u : 2^ — > IR as follows: 

A^(5, i) := v{S U {z}) — v{S) for all 5 6 2^ and all i e N. (1) 

Obviously, the game v is average-convex if and only if the associated function A^ satisfies 

Y, A^(i?\{z}, 0 < E *) for all C P C AT. (2) 

iG-R iGR 

The reason to perform this simplification is threefold: 

(i) All the information concerning the function g^ is deducible from its restriction A^ by means of 
the following recursive formula: 

g^{S, T) = g^{S U {k}, T\{k}) + A^(5, k) - A^{T\{k}, k) for all S,Te2^ and all k e T\S. 

(ii) The function A^ allows us to rediscover the initial game x; : 2^ ^ IR by means of the following 
relationship: 

5 

HS) = ^”({* 1 - * 2 , • • • , ik) whenever 5 = {ii, i 2 , . . . , ij 6 2’^. 

k=l 

(iii) Involving particular notions for TU-games like convexity or average-convexity, it turns out to be 
no problem at all to reformulate these notions for a TU-game v in terms of the associated function A^ 
and vice versa. The reason just being that such notions are defined in terms of incremental returns of 
single players for joining coalitions in the game v and these incremental returns are precisely described 
by the function A^. For instance, the game v is said to be convex if the associated function A^ satisfies 
non-decreasing incremental returns, that is A*'(P\{z},z) < A^(P\{z},z) for alH 6 P C P C N. 

2. Membership of the Shapley value to the core 

The Shapley value {Shi{N,v))i^N ^ IR^ of the game u : 2^ — IR is defined as follows (cf. [4]): 

Shi{N, v) = 7n(s) • 0 for all i G AT, where (3) 

SCAT 
S9t 

(s — 1)! • (n - s)! 
n! 



7n(s) := 



for all natural numbers n, s with 1 < s < n. 



( 4 ) 
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Theorem 2.1 The Shapley value of an average- convex game lies in the core. That is, if the game 
V : 2^ -^JR satisfies condition (2), then its Shapley value {Shi{N,v))i^N satisfies Eigt ^ 

v{T) for allTCN,Ti^ 0. 

Our alternative proof of this theorem is strongly based on an appropriate reformulation of the core 
constraint for a given T C iV, T ^ 0, as a particular property of the 

associated function A". To be precise, the equivalence stated in the next auxiliary Lemma has to be 
considered as a simplified version of a similar part of liiarra and Usategui’s work (cf. [2], Theorem 
1, page 14). Their result states that the Shapley value of the game v belongs to its core if and only 
if the associated function g'^ satisfies ' 7n(s) * 5 '^(*S'\T, 5nT) > 0 for all T C A/^, T ^ 0. 

Here /3t{N) := t for all T C iV and /3r{S) := (n — s)“^ • [|5 fl T| • n - s • t] for all S,T e 2^ with 
5# AT. 

Lemma 2.2 Consider the expressions (1) and (4)- The Shapley value of the game v : 2^ ]R 
belongs to its core if and only if the associated function A" : 2^ x N JR satisfies the following 
condition: 



Y: 7n(s) • E 0 > E 7t(r) • E i) for allTCN,T^ 0. (5) 

ieSnT ieR 

Proof of Lemma 2.2 

Let (Shi{N,v))iQN ^ be the Shapley value of the game u : 2^ IR. Fix the coalition T C N, 
T ^ 0. By (3) and reversing the order of the double sum, it holds 

YShi{N,v) = E L 7-.(s) • A’'(5\{i},i) = E 7«(s) • E 

ieT ieT SCN ieSDT 

S3i ^ - 

Recall that the Shapley value (5/i»(T, v))»eT € IR^ for the subgame u : 2^ -> IR is efficient, i.e., 
= '^(^)* Applying (3) once again with respect to this subgame and reversing the 
order of the double sum, it holds 

v{T) = Y: ShiiT, v) = YE yt{r) • A^{R\{i}. i)= Y 7 t(r) • E 0 

ieT ieT RQT ^^RCT ieR 

RBi ^ - 

Hence, for a given T C AT, T ^ 0, the corresponding core-constraint YlieT Rhi{N,v) > v{T) is 
equivalent to (5). □ 

Proof of Theorem 2.1 

We aim to apply Lemma 2.2 to the average-convex game v. That is, we prove (5), provided that 
(2) holds. Fix the coalition T C AT, T ^ 0. The essential idea is to distinguish among coalitions 
SCN according to their intersection with the given coalition T, that is coalitions having an identical 
intersection with T are treated equally. For that purpose, with any coalition RCT, there is 

associated the collection C 2^ of coalitions as follows: 



TR — {Se 2 ^ \Sr\T = R} that is, Tr:= {S £ 2^ \ S = RC^ P for some P C N\T}. 
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Obviously, with every coalition S C N there is associated a unique coalition RCT so that SoT = R, 
and all coalitions RCT can be generated in this way. Prom this we derive that 

Y: 7 n(s)- E A”(S\{i},t)= E E 7 n(s)- E A’'(S\{i},i) 

ieSnT 9:^RCTSeTR i£SHT 

For the moment, let us also fix the coalition RCT, R^it. Since it holds that R = S nT C S for 
all S eTr, the average-convexity (2) of v yields the following: 



A''(5\{z}, 0 ^ 0 for S eTr. Now we deduce the following: 

iesnT i€R 

Y: 7n(s)- E A’'(S\{f},0 > [EA”(i?\{i}.i)l • [ E 7nW] 

seTn iesnr ^ieR ^ sq.Tr 

The cardinality s of a coalition S &Tr varies from r up to r + n - 1, thus r<s<r + n-t. For any 
size s satisfying r < s < r + n - t, the number of coalitions in Tr with size s is namely one 

has to choose the s - r elements of S\R out of the n - t elements of N\T. At this stage we claim 

r+n-t 

E (rr)-7n(s) = 7f(r)- 

s=r 

The inductive proof of the claim will be postponed and treated below as a separate result. All 
together we arrive, for a given RCT,R^<i,a,t the following chain of (in)equalities: 



E 7nW- E A”(5\{i},f) > [EA''(ie\{O.Ol • [ E 7n(s)l 

iesnT '•»€« •' ^sqTr 

EA*-(ii\{i},f)lTE‘(::;)-7n(®)l 

ieR j L 5=r J 



SQTr 



= 7«W-EA’’(i2\W,0 

t6B 



Summing up over all R CT, R ^ 0, we obtain 



Y: 7nW- E A”(5\{i},i)> E 7«(r)-EA”(fl\W,0 

0^5CiV iesnT 0^RCT ^eR 

Since the last inequality holds for all T C iV, T ^ 0, we conclude from Lemma 2.2 that the Shapley 
value of the average-convex game belongs to the core, provided that we prove the claim above- 
mentioned. This completes the proof of the theorem. ^ 



Lemma 2.3 

n—t 

^ (”ib 0 * + r) = 7 t(r) for all natural numbers r, t, n with 1 <r <t <n. (6) 

k=0 

Proof. 

The proof of (6) proceeds by induction on the size n-t. The case n-t = 0 holds trivially, while the 
case n-t = l holds because of the relationship 7 n(r) + 7 n(r + 1) = 7n-i(^)- From now on suppose 
n-t >2. Let pp{q) := for all natural numbers p, q with 0 < ^ < p. It may be verified that the 
following combinatorial results hold: pn-i{k — 1) + pn-i{k) = Pn(k) for alll<A:<n — las well as 




307 



Ifnik + r) H- 7 „(A; H- r -I- 1) = 7 n-i(A; H- r) for all 0 < A; < n — 1 — t and all 1 < r < Prom this we 
deduce the following: 

^ Pn-t(fc) • 7n(A: + r) 
k=0 

n-t-1 

= 7nW + 7n(n - i + r) + Pn-t{k) ■ Jn{k + r) 

k=l 

n-1— t n-l-t 

= 7n(r) + 7n(n - 1 Pn-i-t{k - 1) • 'fnik + r) + Y, Pn-i-t(k) ■ ■ynik + r) 

k=l k=l 

n-l-t n-l-t 

= Y. Pn-i-t(k) • Jn{k + r + 1 ) + Y Pn-i-t{k) ■ y„(k + r) 
k=0 k=0 

= Y Pn-i-t(k) ■ l7n(k + r + 1) + y„(k + r)] 
k=0 
n-l-t 

= p„-i-t(k) ■ y„-i(k + r) 

k=0 

= 7t(r) 

where the last equality is due to the induction hypothesis applied to n — 1 — t (with respect to n — 1 
instead of n). This completes the inductive proof of (6). □ 

3. A recursive formula for an efficient, linear, and symmetric value 

So far no other value than the Shapley value was considered. In this section we pay attention to a 
particular class of values that are, more or less, related to the Shapley value. Formally, a value $ 
on the universal game space S associates a single payoff vector ^(N,v) = (^i(N, v))i^jv G IR^ with 
every TU-game {N,v). The so-called value u) of player i in the game {N^v) represents an 
assessment by i of his gains from participating in the game. For reasons that will be explained by 
the next Lemma, we are interested in efficient, linear, and symmetric values. Our aim is to present 
a recursive formula for such values. To start with, we need to recall these three classical axioms for 
an arbitrary value ^ on Q. 

Efficiency axiom. = v{N) for all games (N,v). 

ieN 

Linearity axiom. ^(iV, av -f /3w) = a • ^(iV, v)-{- 0- $(iV, w) for all games (iV, u), (TV, w), and 
all a € IR, /? € IR. Here the game (AT, av H- /3w) is defined by {av -f- /3w){S) := a • v{S) -h /? • w{S) 
for all S e 2^. 

Symmetry axiom. $j(iV, u) = ^^^i){NjTTv) for all games (iV, u), all z G TV, and every permu- 
tation 7T of TV. Here the game (TV,7 tu) is defined by {irv){nS) := v{S) for all S G 2^. 

We recall the following characterization of efficient, linear, and symmetric values. To advance a cause 
we present here a slightly rewritten, but equivalent formula taken from Remark 2.6 in [1]. For the 
sake of completeness, we mention that an independent proof of a very similar statement (in which 
symmetry is replaced by equal treatment property for substitutes) can be found in [3], Lemma 13, 
page 13, resulting in a formula identical to the one presented in Lemma 2.5 in [1]. 
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Lemma 3.1 (cf, [1], Lemma 2.5 and Remark 2.6, page 5) 

A value ^ on Q is efficient, linear and symmetric if and only if there exists, for any number of players 
n, n>l, a sequence of real numbers { 6 ]t}jb=o,i, 2 ,...,n 6 ” := 1 , such that, for every n-person game 

(N,v) with n>2, the value payoff vector ^(N,v) = {^i{N,v))i£N is of the following form: 



MN, v)=Yi j„{s) ■ • v(S) - • «(5\{i})l for all i e N. 

Sir fj ^ 



(7) 



In words, the value $j(iV,u) of player zin a game {N,v), as given by the right hand of (7), may be 
interpreted as player z’s Shapley value in a slightly adapted game. For every coalition, its worth in 
the adapted game equals a given fraction of its worth in the initial game, where the fraction depends 
upon the cardinality of the coalition (assuming symmetry with respect to the cardinality of the player 
set). To be exact, by (3) and (7), it holds that ^{N,v) = Sh(N,v*) where u* : 2^ -> IR is defined to 
be v*{S) := 6 ” • v{S) for all S 6 2 ^. 

Now we are in a position to extend Theorem 2.1 to the class of efficient, linear and symmetric values, 
on the understanding that the essential notions (like core and average-convexity) are slightly adapted 
with respect to the sequences of real numbers that describe such a value. 



Corollary 3.2 Let ^ be an efficient, linear and symmetric value on Q, as given by (7), with the 
corresponding sequences of real numbers ^ satisfying 6 ” := 1 . Suppose the 

n-person game {N, v) satisfies the B- average- convexity condition, that is 



E 



v{R)-b’:_, 



^(^{0)1 < E k • <p) - b;-i ■ 

ieiz '■ ■' 



for all RCPCN. 



Then its value {^i{N,v))i^N lies in the B-core, i.e., ^ allT C N, T ^ 0. 



Proof. Since the adapted game {N,v*) is supposed to be average-convex, it follows immediately 
from Theorem 2.1 applied to v* that 

Y, ^t(iV, v) = J2 > ^*(7^) = • v{T) for a\lTCN,T^ 0. 



Next we introduce and establish a recursive formula for every efficient, linear and symmetric value. 
In words, the payoff to player i of some coalition T in an n-person game {N,v), as given 

at the right hand of the forthcoming expression (9), is obtained from averaging over similar payoffs 
to player i arising from coalitions of which the cardinality is one less, and the adapted incremental 
return of player i with respect to the initial coalition T. 



Theorem 3.3 For every n-person game {N, v) and every coalition T C AT, T ^ 0, let the “semi- 
value” ^(T,u,n) = ($i(T,t;,n))ieT ^ IR^ of the subgame {T,v) be defined as follows: 



MT,v,n):=Y:'yti^)- K-v{S)-bU 



SCT 

SBi 




for all i £T, 



( 8 ) 
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where, for any number of players n > 1 , { 6 J}A;=o,i, 2 ,...,n ^ sequence of real numbers with 6 ^ := 1 . 

For an n-person game (AT, v), define the allocation scheme v) = v))tcn, where v) = 

{'ipf {N, v))i^T € IR^ for all T C N, recursively as follows: for allT C N, T ^ 0, and alii eT 

tT{N,v)-.= ^-k-v{T)-bU-v(T\{i})+ ^ (9) 

^ L SCT,a-t-l J 

S3* 

with the convention that xpf{N,v) := 0 for all i € N. Then it holds that %pJ{N,v) = ^i[T,v,n) for 
every n-person game {N, v), all T C N, T ^ 0, and all i eT. 

Proof. 

Fix an n-person game (N,v). The proof proceeds by induction on the size 1 < t < n. The case 
t = 1, say T = {i}, holds because, by (8) and (9), ipj^^N^v) = 6” • u({z}) = ^i({z},i;,n). From 
now on let T C iV with t > 2, and let i € T. By the induction hypothesis applied to coalitions of 
cardinality t - 1, it holds that ipfiN, v) = ^i(S, v, n) for all 5 C T satisfying s = t ~1 and z G 5 as 
well. From this, together with (8), we derive 

X! = X ^i{S,v,n) 

SCT,a=t-l SCT,a=t-l 

S3* S3* 

= X E T»(r) • k • v{R) - 6"_1 • v(i?\{i})l 

SCT,a=t-l RCS J 

S3* H3* 

= X (* - p) • 7t-i(p) • k • v{P) - 6p-i • ■y(P’\{0)l ■ 

PCT J 

P3* 

The determination of the latter double sum is as follows. Given the coalition P cT with i G P, the 
number of coalitions 5 with cardinality t — 1 and satisfying F C 5 C T is exactly t — p. From this, 
together with (9), we obtain 

^ L SCT,a=t-l J 

S3* 

= i . k • v{T) - bU ■ t'(T\{*})l + 7 • X (< - P) • 7t-i(p) • k • v{P) - b;_, • i;(F\{i})' 

CL J C pqt ^ 

P3* 

= X 7t(p) • k • v{P) - 6^_1 • t;(p\{i})l . 

PCT L J 

P3i 

^From this, together with ( 8 ), we conclude that xj)J{N,v) = ^i(T,v,n) for every n-person game 
(N, v), all T C N,T ^ 0, and all i eT. This completes the inductive proof of the theorem. □ 

Corollary 3.4 Let $ be an efficient, linear and symmetric value on Q, as given by (7), with the 
corresponding sequences of real numbers { 6 Z}jk=o,i, 2 ,...,n? ^ > 1; satisfying 6 ” := 1 . Then it holds, for 
every game (AT, v) and all i £ N, that 



v) = 'ip^(N,v) where 'iIj^{N,v) is determined recursively by (9). 
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The special case 6J := 1 is in accordance with the recursive formula for the Shapley value (cf. [5], 
Lemma, page 392). In order to illustrate the impact of these recursive formulae, we use them as basic 
tools to present an alternative proof of our main result. This alternative proof reduces to Sprumont’s 
proof in the context of the Shapley value. 

Alternative Proof of Corollary 3.2 based on Theorem 3.3 

Let ^ be an efficient, linear and symmetric value on as given by (7), with the corresponding 
sequences of real numbers ^ > 1, satisfying 6” := 1. Suppose the n-person game 

(N,v) satisfies the 5-average-convexity condition. The proof of the core-membership proceeds by 
induction on the size of the player set n, n > 1. The case n = 1, say N = {z}, holds because, 
by (7), $i({z},u) = b\ • v({i})- From now on let n > 2, and let T C TV, T ^ 0. We show that 
^ * ^(^)- By Corollary 3.4 and (9), it holds that 

v) = v) = i • [&;; • v{N) - ■ v{N\{i}) + X) «)] for alUeN. 

On the one hand, the 5-average-convexity of v implies 

E ft;; • v{N) - 6;i_i • v(N\{i})] > E k ■ ^(t) - bU • «(n{*})l 

i€T '■ ■' »€T '■ ■' 

On the other, by some combinatorial computations and Theorem 3.3, we obtain 

ieT keN\{i} 

= E E 

teTjeT\{t} teN\TjeT 

= E E ^j(N\{e},v,n) + Y, 

ter j€T\{t} eeN\T jer 

> Ebh-v{T\{e})+ Y b7-v{T) 

ieT teN\T 

where the last inequality is due to the induction hypothesis applied to the subgames {iV\{^},u}, 
£ £ N, the (n — l)-person subgames all of them inherit the 5-average-convexity property from the 
initial game v. Finally we conclude 

Y^i(N,v) > + E 

i€T ^ ieT ^ j ri i^TkeN\{{i 

> + E b7-v{T) 

^ ieT ^ J n 

= 6” • v{T) for all r C TV, T ^ 0. 

This completes the inductive, but alternative proof of Corollary 3.2. □ 
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A Cooperative Approach to Multiagent Incentive Problems 

M. Krapp, University of Augsburg 

Summary: Most analyses of the principal-agent problem in multiagent settings discuss incentive 

schemes which implement noncooperative Nash equilibria in the subgames played by the agents. In 
contrast to these approaches the focus of this paper is on the question whether the principal can 
benefit from allowing cooperation. To do so, two incentive contracts are discussed — one including 
a cooperation clause and another without such a clause. The comparison shows that no contract 
dominates the other one. The principal’s decision rather depends on the anticipated behaviour of the 
agents as well as on the properties of the production functions involved. Finally, a simple example 
is given to illustrate some of the results. 

1. Introduction 

This paper deals with the problems arising when one party, called principal, delegates decision making 
to two or more other parties designated as agents. We assume the agents to take hidden actions 
in correlated environments. One can regard these actions as levels of effort. Since effort causes 
disutility to the agents, the agents’ effort decisions affect both the principal’s and the agents’ welfare 
levels. Therefore, the principal agrees to compensate the agents for choosing the efforts desired by 
him. Unfortunately, he cannot observe the agents’ eflfort decisions but only the stochastic outputs. 
From this asymmetry of information moral hazard arises: the agents decide with regard to their own 
welfare rather than to the principal’s. 

In presence of moral hazard the principal’s problem is to design an appropriate contract inducing — 
rather than prescribing — his preferred effort levels. More technically speaking, the principal has 
to choose a contract which maximizes his residual welfare subject to several constraints describing 
the agents’ anticipated reactions to this contract. In the case of one single agent the properties 
of optimal incentive contracts are well known (cf for example Grossman/Hart (1983), Harris/Raviv 
(1979), Holmstrom (1979), and Ross (1973)): they settle payment schemes which condition the 
agent’s compensation on the produced output. 

In multiple agent settings like the one discussed here the principal has to consider some additional 
aspects. Because the agents act in correlated environments each agent’s individual output may 
also provide information on the other agents’ decisions. One major question is then whether or 
not each agent’s compensation should depend on the outputs of the other agents as well. Most of 
the theoretical work on multiple agent models deal with this question. Although a wide range of 
different models can be found, they have a common feature: the implicit or explicit assumption that 
the optimal incentive scheme implements a noncooperative Nash equilibrium between the agents 
with regard to their effort decisions (cf for example Borland (1992), Demski/Sappington (1984), 
Holmstrom (1982), Ma (1988), and Mookherjee (1984)). We term payment schemes of that kind 
“noncooperative contracts” . 

In contrast to these analyses our focus is on the question whether or not the principal can benefit from 
allowing cooperation. Such a permission implies a cooperative equilibrium in the subgame played 
by the agents. The comparison with noncooperative contracts requires some insight into the specific 
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features of what we call “cooperative contracts” . For that reason we propose a simple multiagent 
model and use it to study noncooperative as well as cooperative incentive schemes (cf Section 2). The 
next step is to compare the two contract designs from the principal’s point of view. This comparison 
shows that no contract dominates the other one. In many settings cooperative contracts are indeed 
superior to noncooperative contracts. Unfortunately, it is difficult to characterize such settings in 
general. We thus consider in Section 3 the case of two agents and derive explicit criteria for the 
superiority of cooperative contracts. 

2. The Model 

2.1 Assumptions 

There are n risk averse agents denoted l,...,n and one risk neutral principal. To keep things 
simple, we take the basic assumptions from Spremann’s (1987) single-agent Linear-Exponential- 
Normal model (LEN model) and formulate them for the multiple-agent case. 

(L) Let Xi denote the level of effort supplied by agent i and Si the random variable describing the 
environmental risk agent i (i = 1 . . . , n) is confronted with. His output yi is then given by the 
linear production function 

n 

Vi — Xj Si (i = 1, . . . , Tl). (1) 

i=i 

The coefficients Uij {i,j = 1, . . . , n) describe how the output of agent i depends on the effort of 
agent j. Teamwork for instance can be expressed by Uij > 0, sabotage by Uij < 0 and different 
levels of productivity by ua ^ ujjj (in each case j). Using the Matrix := [uij]nn and the 
vectors x := [xi]„, e := [ei]n and y := [yi]„ we can write 

y = fix -h e. (2) 

Another class of linear functions is given by the compensation functions 

n 

Pi = ri + Y,^nVi = (3) 

i=i 

where pi is the compensation agent i (i = l,...,n) receives from the principal. In order to 
simplify notation we define the matrix S := [sij]nn and the vector r := [rj]„ to collect the 
compensation functions’ coefficients. The remuneration vector p := [pi]„ is then 

p = r -h Sy. (4) 

(E) We assume agent i to have a globally concave exponential von Neumann- Morgenstern utility 
function which exhibits constant absolute risk aversion a, (i = 1, . . . , n). In order to keep the 
model simple we assume all agents to behave identical with respect to their risk aversion, ie 
ai = a > 0 (z = 1, . . . , n). 
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(N) The random vector e containing the individual random variables is distributed according to a 
multivariate normal distribution with mean vector 0 (an n x 1 vector whose elements are all 
zero) and positive definite covariance matrix E (of order nx n). 

Because the principal is assumed to be risk neutral, his goal is to maximize the expected value of his 
residual benefit, = E [i'(y — p)], where i is an n x 1 vector containing a column of Is. Furthermore, 
let I be the n x n identity matrix. We now can substitute the productions functions (2) and the 
compensation functions (4) into to get 

= i'[(I - S)flx - r]. (5) 

Because of the given information asymmetry, each agent tends to maximize his own objective function 
Ui (i = l,...,n) which shall be the certainty equivalent of compensation minus the disutility of 
effort. The latter term expresses the personal cost incurred by the agent when choosing a certain 
effort. We assume agent i’s disutility of effort to be Xi^. Assumptions (L) and (N) imply that 
each agent’s remuneration is an affin-linear transformation of normally distributed random variables. 
Therefore, the remuneration’s certainty equivalent with regard to the exponential utility function 
can be expressed as the expected value minus the variance times half the risk aversion (cf Freund 
1956): Ui = E (pi) - | Var (p<) (i = 1, . . . , n). 

We now proceed to obtain a vector representation of all agents’ objective functions, u := [uj]n. First, 
we arrange all disutilities of effort in an n x 1 vector := [x{^]n- Because agent i’s (z = 1, . . . , n) 
compensation variance is equivalent to the ii-th element of the matrix SES', we need a vector 
containing the main diagonal of this matrix. Such a vector can be obtained by making use of the dg 
operator (cf Magnus/Neudecker (1988)): dg(SES') is a diagonal matrix with the same main diagonal 
as SES'. Now we can write 

u = r + Sflx - x^ - ^ dg(SES') i. (6) 

Agent i is assumed to accept an offered incentive contract only if he can attain at least his reservation 
welfare, rrij (i = 1, . . . , n). Let m := [mjjn be the n x 1 vector collecting all reservation levels. Then 
the principal has to consider the inequality 

u > m (7) 

which we term reservation constraint. An easy to prove feature of the LEN model is that optimal 
incentive contracts exactly keep all agents at their reservation levels, ie u = m. Furthermore, the 
principal has to take into account the way agents respond to an accepted incentive scheme: they 
choose their efforts to maximize U{ (z = 1, . . . , n). We shall denote the n x 1 vector of responses by 
X. The equation 

X = X (8) 

describing these responses is called incentive constraint. How x is computed depends on the type of 
contract (cooperative or noncooperative) and is thus considered afterwards in separate sections. 
Before stating the principal’s optimization problem we finally assume that the technology matrix 
Q, the covariance matrix E as well as all agents’ objective functions u and reservation levels m are 
publicly observable — and therefore known to all parties. 
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2.2 Noncooperative Contracts 

In this section we analyse noncooperative incentive contracts. Such contracts suppose the vector of 
responses, x, to implement a noncooperative Nash equilibrium between the agents. That is, each 
agent regards the other agents’ effort decisions as given and hence maximizes his objective function, 
Ui, with respect only to his own effort decision, Xj. Since Ui is a globally concave function of x*, 
the first-order condition du^/dx* = 0 is necessary and sufficient for a maximum. As duj/dxi is 

n 

Sij Uji — 2 Xi for any i = 1, . . . , n (by equation (6)), we get 
i=i 

*=^dg(Sfi)i. (9) 

Now consider the principal’s problem. He chooses the coefficients of the compensation functions, 
r and S, to maximize the expected net benefit, subject to the reservation constraint u = m 
and the incentive constraint x = x. The method of Lagrange multipliers provides a straightforward 
solution to this problem. First, we substitute the response x for x in the objective function, ^n- 
Then we define an n x 1 vector of Lagrange multipliers. A, which we use to incorporate the reservation 
constraint. The resulting Lagrangian formula is 

L(r,S,A) = $n + A'(u-m) 

= i'[(I - S)fix - r] + A' [r + Sfix “ | dg(SES') i - mj . (10) 

A considerable simplification can be achieved by exploiting the first-order condition dL/dr = 0. 
Because dL/dr is — i+ A, we know A = i. If we insert this into (10), we get the simplified Lagrangian 
formula 

L+(S) = i' [fix - X* - I dg(SES') i - m] . (11) 

Note that r has completely disappeared. This observation indicates the minor importance of the 
constant compensation coefficients and proves 

Proposition 1 The principal maximizes his objective function with respect only to the risk sharing 
parameters; the constant compensation coefficients are irrelevant. 

To get the optimal solution, we first insert x as given by (9) into (11) and then set the gradient of 
the resulting Lagrangian formula L'^(S | x = |dg(SQ)i) equal to 0. Since is a function of a 
matrix, some matrix calculus is needed to compute the gradient. Omitting details we obtain 

S* = [ 2 al + dg(n'E-‘n)]-Mg(ii'n) n'E-\ ( 12 ) 

where S* denotes the principal’s optimal choice of the risk sharing parameters S. Because we assume 
E to be positive definite as well as a > 0 the existence of an unique global maximum is guaranteed. 
The proofs of these results are rather tedious and therefore left out. They can be obtained from the 
author. Notice a remarkable feature of S*: it is diagonal if and only if Q and E satisfy 

Q = EA, (13) 

where A is any diagonal matrix. An optimal noncooperative incentive contract then compensates 
each agent on the basis of his individual output alone. 
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Proposition 2 Independent compensation can he optimal even if there are technological and stochas- 
tical dependences. 

Finally, we shall compute the principal’s optimal expected net benefit, ie the value of his objective 
function when choosing a noncooperative incentive contract characterized by S*. This value is given 
by L'*'(S* I X = I dg(S*Q) i). In particular, 

^ i'ft dg(S*n) i - i'm. (14) 

2.3 Cooperative Contracts 

We now analyse the case of incentive contracts including a “cooperation clause” , ie implementing a 
cooperative equilibrium in the subgame played by the agents. In other words: the principal permits 
the agents to coordinate their effort decisions. By doing so the community of all agents acts as 

a virtual decision maker who determines each agent’s individual effort and shares the sum of all 

compensations. We shall term such a group a syndicate (cf Wilson 1968). This fiction simplifies 
the principal’s problem drastically. He is confronted to merely one decision maker — namely the 
syndicate. And since the principal is indifferent to the source of his gross expected benefit, it is 
sufficient to base the syndicate’s compensation on the sum of outputs. We can therefore use the 
simple remuneration function 

p = r-f-si'y. (15) 

This equation describes the sum paid to the syndicate as a linear function of the sum of all outputs. 
The syndicate’s sharing rules — perhaps the result of a bargaining game — are of no interest to the 
principal and remain therefore unconsidered. As in the preceding section, the principal maximizes 
the expected value of his residual benefit which is now 



= E(i'y - p) = (1 - s) i'ilx - r. (16) 

Because, by assumption, all agents are identical with respect to their von Neumann-Morgenstern 
utility functions, it is feasible to simplify matters by supposing one specific agent to make all effort 
decisions. Hence, the syndicate’s objective function is 

u = r -\- s i'flx — x'x — i'Ei, (17) 

where x'x is the sum of disutilites of effort and i'Ei the variance of the remuneration given by 
(15). Since the sum of remunerations is paid to the syndicate, this sum must at least equal the sum 
of all reservation levels. Therefore, the new reservation constraint is 

u > i'm (18) 

or, taking into account the above mentioned feature of the LEN model, u = i'm. When computing 
the response vector, x, which characterizes the incentive constraint, we must recognize that the 
agents now play a cooperative game. The syndicate consequently maximizes the objective function u 
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with respect to all efforts. Hence, the first-order (necessary and sufficient) conditions for a maximum 
are 

^ = s fi'i - 2 X = 0 

Again, the principal’s problem is to maximize his objective function, with respect to the re- 
muneration parameters, r and s, subject to the reservation constraint u = i'm and the incentive 
constraint x = x. After substituting x = x into we use the (scalar) Lagrange multiplier A to 
formulate the Lagrangian function 



L(r, s. A) = + A (u - i'm) 

= (1 — s) i'^^x — r H- A + s i'ftx - x'x ~ ^ - i'm^ . (20) 

In order to simplify this formula, we consider the first-order condition dL/dr = 0. The partial 
derivative of L with respect to r is clearly — 1 + A. Hence, A = 1. Inserting this into (20) gives 



L'^{s) = i'Qx — x'x ■ 



— i'Ei — i'm. 
2 



(21) 



The constant compensation coefficient, r, vanishes as stated in proposition 1. Maximization of 
L+(5 I X = IfiQ'i) now produces the unique solution 



i'fifi'i + 2ai'Ei 



( 22 ) 



From the assumptions that E is positive definite and a > 0, we know 2 a i'Ei > 0. Furthermore, 
i'Qft'i > 0, too. This establishes the existence of an optimal cooperative incentive contract. Addi- 
tionally, we know s* € (0, 1), ie 



Proposition 3 Cooperative contracts assign a real positive share of the output sum to both parties — 
the syndicate as well as the principal. 



The associated value of the principal’s objective function when choosing the cooperative contract 
variant is L"‘’(s* | x = ^sQ'i) and can be written 

= is* i'nn'i - i'm. (23) 

3. Comparing the Contracts 

An important issue is what type of contract — cooperative or noncooperative — the principal should 
prefer. First, notice that cooperative incentive contracts are in a sense collusion resistant: Most 
noncooperative compensation schemes cause great damage to the principal when the agents brake 
the contract by coordinating their actions. Since cooperative contracts explicitly allow the agents to 
arrange their effort decisions, collusion in that sense is impossible. So, 



Proposition 4 If the agents evidently strive to collude, cooperative contract variants are superior to 
noncooperative ones. 
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Otherwise, the principal’s contract decision is not as clear. Since his goal is to maximize the expected 
residual benefit, he may compare the two attainable values, and If the principal 

should prefer the cooperative contract and vice versa. Formulae (14) and (23) yield 

^cc > ^nc ^ j/QQ/j > dg(S*fi) i. (24) 



Unfortunately, we cannot state a more sophisticated criterion. Indeed, the principal’s decision 
strongly depends on the specific properties of the covariance matrix, E, as well as the technology 
matrix, ft. 



Example 1 In order to get some insight into what conditions favour the use of cooperative contract 
designs, we now consider a very simple two-agent setting characterized by the matrices 



b 

II 


’ 1 


P 


and 


ft = 


’la;' 




. P 


1 






u 1 



(25) 



We shall assume p,a; € (— 1, 1) and term p ^‘correlation parameter” as well as u “production param- 
eter”. If eg LJ > 0, each agenVs effort positively influences the colleague's output. The principal’s 



welfare levels associated with the two contract variants are now 

1 (l + o;)" 



^ 5 " = 






2(1+ uY + 2aa^ (1 + p) 

1 (1 + [1 + ^ (^ ~ 2 p)] 



— 7721—7712 



and 



21 + c*;(a; — 2p) + 2aa^ (1 — p^) 



— 7721 — 7722 



(26) 

(27) 



respectively. It follows immediately that 






4=» p< 



2 u 
1 + 



(28) 



So, if p £ (— 1, 2o;/[l +o;^]) for any u; G (—1, 1), the principal should prefer the cooperative contract. 
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Figure 1: Region in (a;,p) space where the cooperative contract is superior. 

Figure 1 illustrates these settings in (u,p) space. Especially, if there is teamwork in negatively 
correlated environments (ie w > 0 and p <0), the cooperative incentive scheme is clearly the better 
alternative. On the other hand, sabotage under positive correlation (u < 0, p > 0) indicates the 
superiority of the noncooperative variant. Finally, if u and p have the same sign, a more careful 
investigation is necessary. 
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Referring to the preceding example, we finally assert 

Proposition 5 The application of cooperative incentive schemes can be beneficial to the principal 
even if the agents do not plan collusion. 

4. Concluding Remarks 

We have explored a number of issues in connection with the use of noncooperative and cooperative 
incentive contracts. One main result is that no contract variant dominates the other one. In many 
settings cooperative contracts are indeed superior to noncooperative contracts. — Especially, if the 
principal anticipates collusion. Of course, much work remains to be done. More sophisticated 
criteria for the superiority of alternative contract designs may be required. Another open question is 
to what extent the findings generalize if we apply cooperative equilibrium concepts to more complex 
frameworks. Finally, the applicability to real life problems has to be analysed. 
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Reputation under Imperfect Observability 
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Summary: The paper uses a two-person, finite horizon game representing a cooperation project. 
An ex-ante agreement stipulates each player’s period input level, the corresponding payoff vector 
strictly dominates the Nash equilibrium. Players cannot observe their partner’s input (defection 
from the agreement is therefore not revealing), but form an adaptive expectation as to the partner’s 
input. The paper investigates when the reputation effect will be strong enough to make the ex- ante 
agreement self-enforcing. 

1. Introduction 

Reputation is an essential element to overcome a prisoners’ dilemma by establishing and maintaining 
cooperative behaviour. The reputation game introduced by Kreps et al. (1982) showed that players 
caring for their reputation will not try to exploit short-term advantages, provided they are not sure 
whether they are playing with an ’’altruist” (i.e., somebody who adheres as long as his partner 
adheres). Kreps and Wilson (1982) introduced a similar analysis for the chainstore game where 
the entrant is not sure about the monopolist’s type. Those games operate under some decisive 
assumptions: (i) defection from cooperation (accommodating an entrant) is revealing, observability 
is perfect (ii) there is a common prior about playing with an altruist, (iii) partial defection is not 
modelled. This paper incorporates those elements investigating the effects of the players’ care for 
their reputation on their adherence to agreed cooperation terms. 

Consider a cooperation project represented by a two-player game (players i, j who are exchangeable) 
with discrete time intervals t; play ends in period T. Also assume that in each period, players 
contribute one homogeneous, arbitrarily divisible input a\ 6 into the joint project (the player’s 
performance); choice of this input level constitutes the players’ actions which he has to chose at the 
beginning of each period. Once chosen, a\ is unalterable in t. Each player’s period payoff ttJ is then 
given by 

7t{ = aj + s*aja^ - af + rf (1) 

with the best-response function /?* : a**(ut) = 5* G and remains constant over the entire 

game, r/* indicates the amount of extra utility i generates from mutually cooperative behaviour. 
T)^ = 0 for most of the players in the population, but there are some players for whom 77 * is an 
arbitrarily high amount of extra payoff exceeding any possible short-term gain achieved by defection 
from the agreement, s* is commonly known, all other parameters in ( 1 ) are private knowledge of the 
respective player. Since 

da\dal 

the game is supermodular, therefore, the action set need not be compact and convex to ensure 
existence of a Nash equilibrium (see Topkis 1979 and for a more recent application Milgrom and 
Roberts 1990). Equating the two best-response functions the Nash equilibrium a*j^ = 
generating payoffs can be found as 
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a 



i* 

N — 



2 + 5 * 

4 — s*s^ 



and = 



2 + 5 ^' 

4 - s»s^‘ 



Assumption 1: s*s-^ < 4. 

Otherwise there will be no positive Nash equilibrium actions and the transaction will deadlock. Let 
us assume that, before play started, both players agreed upon some action vector a = (d*,a^) as 
per-period input. Further assume that a strictly pareto-dominates the Nash equilibrium - it should 
therefore be in the mutual interest of both players to maintain the cooperative action. If both players 
are mutually aware that they are no altruists, play of the single period Nash equilibrium in all t is 
the Nash equilibrium of the entire game. If players are uncertain as to their type but are able to 
observe each other perfectly, a reputation game like the one analyzed by Kreps et al. (1982) emerges. 

2. Imperfect Observability 

In this game, however, players are not only uncertain as to their fellow-player’s altruism, but they 
are also assumed not to be able to directly observe their partner’s actions. Therefore, they have to 
look for some indirect way to conclude their partner’s adherence to stipulated performance. Since (1) 
would indicate that the players’ payoffs are deterministic depending only on the parameters and 
period performance, each player - knowing his own period input - could easily conclude his partner’s 
input. This, however, would be a rather unrealistic modelling. If the jointly produced output is 
sold on a market, fluctuations in demand and prices may distort payoffs. Let us therefore assume 
that input costs (af ) and proceeds (a{ + s'a\al) accrue at the end of each t with players correctly 
anticipating their input costs, but real proceeds being stochastically distorted: = aj + s^a\al + et. 

Let 6t be continuously distributed, unbiased and serially uncorrelated to all other model parameters. 
Of course, players could be assumed to form a joint probability distribution /i*(!EJ|aJ, aj) which enables 
i to construct a confidence interval for j’s action knowing his own E\ and a\. This would resemble 
the collusive oligopoly model of Green and Porter (1984) and Abreu et al. (1986) where a group 
of oligopolists conclude a collusive agreement on output to keep industry prices high, but cannot 
observe each other’s actual output. Instead, they possess a density for industry prices depending on 
output enabling them to construct a confidence interval for the fact that all partners adhered to the 
collusive agreement given industry prices. If they decide that the agreement must have been violated, 
they produce (high) Cournot output for some periods before returning to the collusive output regime 
again. Such punishment, however, does not seem very efficient: (i) in a group of more than two 
partners the real ’’culprit” (if any(!) - industry prices are stochastically distorted) is never found 
out, (ii) punishment is collective which means that all players suffer from the high Cournot output 
during the punishment phase. 

In a cooperation project, however, partners will be able to (imperfectly) observe each other. Let 
us therefore assume that player i, at the end of each period t, receives a signal = aj + u\^ as 
to j’s performance (and v.v.). is serially uncorrelated to all other parameters; it is continuously 
distributed with E{u\^) = 0 and the variance V^{u\^) = V*(|a5l) with V*(0) = 0 and ^ > 0 (meaning 
the higher aj the higher the distortion to be expected). Players receive those signals at the end of 
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each period and use it to update their expectation at = as to the respective fellow-player’s 

performance in t and in all periods until T by way of simple adaptive learning 






= 9 



t-1 



<1 



+ ( 1 - 9 ) 



ai + u\^ 
a\ + uf 



( 2 ) 



0 < g < 1. Both the signal received and the current expectation a*t are z’s private knowledge. 
The run of events in each period can be summarized as 



• at the beginning of t, players have expectations 

• they simultaneously decide upon their actions aj, a\ which are unalterable throughout t\ 

• at the end of t, players receive actual payoffs and the signals aWai* which are used to update 
expectations. 



3. The Temptation to Defect 



Due to imperfect observability, also a player’s best-response can only be a based on the current 
expectation: 



«”(an = 



1 + 

2 



Player i may completely defect to that reaction strategy or he may partially defect to any a\ G 
]a**(o:J^); d*[. This presupposes 

Assumption 2: The optimum response is smaller than stipulated a*. 

This assumption seems unproblematic, for if a**(a*t) > a*, adherence to the agreement is not an issue 
any more and i will provide (at least) stipulated input even considering an isolated period. However, 
this presupposes either aP >> a* and more or less adherence by j or, alternatively, an excessively 
positive (and most presumably undeserved) ’’image” a\^ of j with i. The first alternative seems 
unrealistic - why should j enter into such a lop-sided agreement? Depending on the assumed density 
function for the signals also the second alternative seems improbable. Both cases will therefore not 
be considered in the following. Due to the quadratic payoff function and the fact that a'*{a\^) < d* 
there is a positive ’’temptation value” for all a\ G [a**(aj-^); d*[ 



^V;• = 7^(a{,a^-7^(d^a^ (3) 

Note that due to the quadratic property of the payoff function defection below the best response 
a**(aj^) does not pay. The question arises what happens, if both players completely defect playing 
their best responses. 

Proposition 1: Once both players completely defect, play degenerates to the Nash equilibrium. □ 
Proof can be found in the Appendix. In this context, the Nash equilibrium becomes the punishment 
for not being able to maintain cooperative behaviour in the joint project. Let us now examine how 
care for their reputation increases the players’ input and helps to maintain cooperative behaviour. 
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Assume that players, in case of mutual defection (or loss or trust which amounts to the same), expect 
immediate degeneration to the Nash equilibrium actions (which indicates the worst payoff they can 
get). 



4. Reputation Effect 

Let us consider the period decisions from rational (i.e., non-altruistic) z’s point of view working from 
the last period T backwards (but both players are exchangeable), rf^rp and therefore their altruism 
is the players’ private knowledge, nor is there any common prior as to the amount of altruists in the 
population. Instead, i has an initial expectation as to the performance to be expected by j ad 
personam. That expectation may be derived from earlier cooperation experience with the partner 
or from some impressions during the preparation of that particular project. In every real-world 
cooperation project, there will be some minimum input level which will be just acceptable without 
losing the other partner’s trust into one’s own correctness, reliability, etc. (Note: players are assumed 
to stay in the cooperation until T, the exit decision is therefore no issue in this paper). This level 
which is just enough to maintain the belief in the fellow-player’s potential altruism can be assumed 
common knowledge. Let 6a* and Sa^j 6 e denote these levels. Any expectation below that 
minimum level will cause a partner to lose the trust into his fellow-player for good. 

The last period, t = T: The altruistic j will defect if and only if she has lost her trust in z, i.e., 
0 ^ < Sa\ rational j and i will play their respective best response a**(a^), a^*(or), as it does not 
pay any more to maintain an altruistic ’’image”. 




The last but one period t = T-l: 

Consider Fig. 1 showing rational z’s decision. Imperfect information creates two information problems 
for i: 

• to estimate fs future performance (for which he uses a]^)\ in Fig. 1, j’s stipulated input would 
be a*^, but is still in the ’’permissible area”; 



• to estimate j’s opinion of himself, j may have already abandoned her belief in z’s potential 
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altruism, switched to a reaction strategy and it does not pay for i to invest in maintaining his 
image as potential altruist any more. 

Since i does not receive any information as to the current of*, he can only use permanent self- 
observation to estimate it: ej = qe\_i + (1 — q)a\_i. All a\ and therefore also e\ are z’s private 
knowledge. If i plays a\ in Fig. 1 and given his knowledge of (Hint: for the sake of a 

common scale on the x-axis Fig. 1 depicts ^(ot*|aJ) with a lower bound of zero and an expected 
value of aj; by shifting the density by a\ one gets (p{u\^\a\) with a lower bound of —a\ and an expected 
value of zero), i knows that any <1 < Sa^ or 



q(4' {1 - q){a\-\-u{") < 5a' 

or using self-observation qe\ + (1 — q){a\ + uj') < 5a' (4) 

will destroy his image as potential altruist with j in t + 1. This enables i to derive the probability of 
losing j’s trust in t + 1 as 



-a\ = 4' with = J ip{ul'\al)dwi' (5) 

-“i 

If i manages to maintain his image as potential altruist in t+1, he will be able to play his best response 
which (given his current expectation in t) will earn him payoffs of tt'^ = 7r*(a**(o'J'^), a'l) (Hint: in t, i 
assumes j to perform a'/ in t and all remaining periods until T). The player’s maximization decision 
will then be 



4”+l] = + (1 - P't+l)'^R] (6) 

with aj** denoting the optimum action. The behaviour modelled here does not seem unrealistic. By 
permanent self-observation i has built-up an estimator for j’s opinion of himself. Of course, i would 
prefer to play a**(o:J-^), but he knows that the lower his performance, the higher the risk of losing fs 
trust. If the considerations attributed to i in the model fail, they do so for the same reasons as in 
real-world situations: (i) z’s self-observation and fs opinion of i differ substantially (i.e., oP^' « ej) 
or (ii) the distortion uf is extremely negative. 

Proposition 2: Care for their reputation will increase the players’ performance, provided discovery 
of defection reduces the resp. player’s payoffs. □ 

Proof: We are still in t = T — 1. Denote 



dai 



which is non-linear (depending on the density function assumed) and negative (higher input in t 
decreases the probability of losing one’s image as potential altruist in t+1). The first-order condition 
for a\** in implicit form derived from (7) is 




325 



dTrL 



1^ = 1 + - 2a’ + - <i>UiTr)i = 0 

or rearranged 1 + s'a'^ = 2a| + [ttJj — irj,j 



(7) 



Given a\^ the left-hand side of (8) is a constant. (8) corresponds to the first-order condition in an 
isolated period plus the expression +(f>\^i [ttJj — Since the expression within the brackets will be 
positive and since is negative, the entire expression will be negative thereby increasing 

the level of a\ necessary to meet the equality, (cont.) □ 

Any period t <T: The maximization problem in (7) can readily be extended to a set of Bellman 
equations: 



4*11 = oi’) + E (4+1;T] I } (8) 

5. An Approximation 

There is a numerical solution to (9) for a given set of parameters, but let us make an approximation 
for a large number of remaining periods: 

Assumption 3: Players assume that their current optimal a]** will prevail in all periods until T. 
This is less myopic than it may seem: assume i increases input in t by Aa}. This will not only lower 
but also increases by (1 — ^)AaJ which in turn decreases the limit cj+i for any chosen 
thereby making detection as non-altruist by the partner less likely for any input level which in turn 
decreases the optimal input in t + 1. Therefore, depending on the exact form of the chosen density 
function optimum input will oscillate in a certain area, i will be aware that over time a\. (6) 
will therefore become 

_ n* r 

(Pt = -1 — ^ With r= ^{ur\ai)dur (9) 

i — ^ j 

i's optimization problem as stated in (9) then becomes 

4n = + t)w'^ + (1 - F% 

af) + F'(T - t - l)7r’,) + • • • (FV), + (1 - F‘)4 (a’*(an, a’>) • • •)) (10) 

or 4 ti = ""E V - a^) + (1 - FY~^n'^ + 

n=0 

+ ""E:‘ \l - F0"F’(T - t - n)4 (11) 

n=0 

The d-th sum in the first summation in (12) will be 5^ = (1 - F*)® + (1 - F*) H 1- (1 - 

Multiplying both sides by (1 — F‘) and substituting the limit value of the series one gets 

1 - F* 

(l-F’)S, = i^ 
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or 

Since we now consider t with a large number of remaining periods T - t, (1 - becomes 

negligible. Considering the second summation in (12) the d-th sum is (dividing by F*): 



^ = (l-FO“(T-f) + (l-r)‘(r-t-l)+ 

= -(1 - F‘)(T - 1) - (1 - F‘)^(T - 1 - 1)- 

-(1 - ( 12 ) 

The second line in (13) is derived by multiplying the first by —(1 — F*). Adding the two expressions 
in (13) yields Sd — d—{\ — F*) — (1 — F*)^ — (1 — F*)^~^ whose limit is 

^ l-F* 

d — . 

F* 

For large T — t, (9) can therefore be approximated as 

4:T] = I 

Proof Proposition 2 (cont.): The first-order condition for a\** is then 



or 1 + s'o^^ = 2aj + ^(7r*(aj, a)-’) - 7rj^) 



(13) 



This again corresponds to the first-order condition in an isolated period plus the reputation effect 

jri ^ ' 

As long as 7r*(a{, aj^) > and i stands to lose by being found a non-altruist, the entire expression 




is negative ((/>* < 0!), which increases the equilibrium a\ necessary to meet the equality. □ 
Considering (15) i basically faces the following trade-off when considering the effect of an increase in 
a\ beyond the best response a**(a*t): an increase in performance will lower the risk of being found 
a non-altruist F/^^ at the expense of excess marginal costs; due to the quadratic cost function that 
excess will increase. On the other hand, depending on the density function assumed, the effect of 
lowering F/^^ by providing additional input a\ will fade the more i increases his performance (cf., for 
example, the c.d.f. of a normal distribution). (15) describes the optimum self-interested a\** due to 
the reputation effect. It readily follows: 

Proposition 3: Stipulated actions a will be self-enforcing in all periods where (i) players still trust 
each other, i.e., a\^ > Sa^ and > 6aP, and (ii) a\** > a\a{** > aPD 
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Appendix 

According to (3) in the main text the system of expectations evolves as a\^^i — qa\^ = (1 — q){(4 + 

u\^) and (4Xi - qoPt = (1 - g)(aj H- uf ). Substituting best responses into (1) and transforming the 

system into matrix notation we get 



L 



«t+i 



-g 








r i=2+(i-9k] 










1 

rH 

+ 

1 



(1) 



or simply K at = d. The system reaches stable state if at+i = at = a meaning that 

a = (I-hK)-\d or 



aij 



1 



1 



{l-Q) 



2 _ ' 



l-q 

2 



2 

l-q 



.d = 



11 ^ + + (1 _ 

^ + (1 - 



This means that reaches stable state when 



( 2 ) 
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The expected value of all ut is zero, hence, reducing by (1 — q)^ (3) can be simplified to 

- 2^4 _ 

l-slf 4 - 

4 

which corresponds to the Nash equilibrium actions shown in (2) in the main text. It remains to be 
shown that the system of expectations also converges towards that point: 



X-g -11^ 
X-q 



= (X-qf- 



(1 — q)‘^s's^ 
4 



= 0 



with the characteristic roots 



Ai,2 



= q± 




(1 - q^s^s^ 
4 



Since s\ > 0, the roots in (4) are real. Finally, the convergence criterion is given by 



q-h- 



1 - 






< 1 or sV < 4. 



Since < 4 is required to avoid a deadlock and all games under consideration were assumed to 
meet that condition. Proposition 1 holds for all these games. □ 
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Dynamische Portfolio-Selektion in stetiger Zeit unter 
Beriicksichtigung von Kursspriingen 

Bernhard Nietert, Universitat Passau 

Summary: Jumps call for a completely new portfolio theory: we have to integrate jump risks 

explicitly into portfolio planning via hedge terms. Therefore, it is clearly a second-best solution 
either to ignore jumps as rare events at all or to simply adjust mean and standarddeviation of stocks 
and to continue applying the optimization formulas of the pure diffusion case. 



1 Einleitung 

1.1 Einfiihrung in die Problemstellung 

Das exakte Aussehen dynamischer Portfolio-Selektions-Modelle hangt stark vom unterstellten Ak- 
tienkursprozefi und damit der genauen Zahlungsstromverteilung ab. Insofern kommt der fundierten 
Auswahl des Aktienkursprozesses eine entscheidende Bedeutung zu. 

Wir legen unserem Beitrag kombinierte Sprung- und Diffusionsprozesse zugrunde: Dies geschieht zum 
einen aufgrund der empirischen Relevanz von Kursspriingen (fiir den US-amerikanischen Markt: 
Ball/Torous (1985) und Jorion (1988), fiir Deutschland: Trautmann/Beinert (1995) und Beinert 
(1997)), zum anderen aufgrund ihrer okonomischen Signifikanz: Die drastischen Kursausschlage an 
den Weltaktienmarkten, z.B. wahrend der Oktober-Crashes von 1987 und 1989 oder der siidostasiati- 
schen Krise 1997/98, fordern Antworten auf die Pragen, wie Crashes durch Portfolio-Planung mog- 
lichst neutralisiert werden konnen bzw. wie man die positiven Effekte von Kursexplosionen ausnutzen 
und sich noch vor ihnen engagieren kann. 

Interessanterweise haben trotz der offensichtlichen Bedeutung von Kursspriingen nur wenige Ar- 
beiten versucht, Spriinge in die Portfolio-Planung zu integrieren: Lediglich Aase (1984) sowie, mit 
Einschrankungen, Eastham/Hastings (1988), Hastings (1992) und Spremann (1997) nehmen eine 
Skizzierung vor, wobei das Gewicht der ersten drei Arbeiten auf einer rein technischen Darstellung 
kombinierter Sprung- und Diffusionsprozesse liegt: Weder prazisieren die Autoren die Sprungkom- 
ponente noch berechnen sie optimale Wert papier anteile. Spremann (1997) wiederum bildet praktisch 
den Gegenpol dazu: Er argumentiert in einem statischen Modellrahmen iiberwiegend intuitiv und 
weist anhand von Beispielen nach, dafi die klassische Portfolio-Theorie im Sprungfall nicht zielfiihrend 
ist. Allerdings lafit sich auch seinen Aussagen noch keine okonomisch fundierte Portfolio-Theorie 
unter Spriingen entnehmen. Deshalb versucht unser Beitrag, Kursspriinge und Portfolio-Selektion 
okonomisch miteinander zu verbinden. 
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1.2 Entwicklung des Modellrahmens 

Um die Wirkung von Spriingen herausarbeiten zu konnen, gehen wir von einer moglichst einfachen 
okonomischen Umwelt aus. Dies aufiert sich in dem unterstellten vollkommenen Kapitalmarkt, der 
Anfangsausstattung des Investors und seinen Praferenzen. Konkret: Wir bewegen uns in dem von 
Merton (1969) und (1971) entwickelten Modellrahmen: 

- Markte sind arbitragefrei, friktionslos und Entscheider agieren als Mengenanpasser. 

- Erlose aus Leerverkaufen sind vollstandig verwendbar und Anlagebeschrankungen nicht vorhan- 
den. 

- Investoren haben freien und gleichen Zugang zu bewertungsrelevanten Informationen. 

- Investoren verfiigen iiber keinerlei exogenes Einkommen, d.h. Vermogensveranderungen werden 
ausschliefilich durch Wertpapiertransaktionen und Konsum induziert. 

Ziel des Entscheiders ist es, im Zeitablauf den Konsumnutzen und im Planungshorizont den Verer- 
bungsnutzen durch (Konsum- und) Portfolio-Entscheidungen (Wahl aus einer sicheren Anlage und n 
nicht redundant en Aktien mit invert ierbarer Varianz/Kovarianzmatrix- alle Anlagen ohne Wahrungs- 
risiko) zeitstetig zu optimieren. Die Nutzenfunktionen seien 

- additiv separierbar (nur Konsumnutzenfunktion), stetig, streng monoton steigend und streng 
konkav bzgl. ihrer Argumente Konsum bzw. Endvermogen. 



2 Das erweiterte Modell dynamischer Portfolio- 

Entscheidungen: die Optimierung unter Kursspriingen 

2.1 Theoretische Analyse 

Die iiblicherweise in zeitstetigen Modellen unterstellten Diffusionsprozesse repr^entieren die “nor- 
male” Kursentwicklung der Aktie hervorgerufen durch (vgl. Merton (1976), S. 127) zeitweilige 
Angebots- und Nachfrageungleichgewichte, Anderung von Kapitalisierungsraten, Anderungen der 
okonomischen Aussichten, neue Informationen, die marginale Bewegungen des Aktienkurses nach 
sich ziehen, usw. 

Spriinge bilden dagegen kurzfristige Kursveranderungen infolge aufiergewohnlicher Vorgange explizit 
ab. Dabei bezeichnen wir den Fall negativer Kursanderungen als Crash oder Kurseinbruch (C) und 
den positiver als Kursexplosion (E). 

Unserer Betrachtung legen wir sogenannte “firmenspezifische Spriinge”, d.h. Spriinge, die nur die 
Aktien eines Unternehmens betreffen und die Aktien anderer Unternehmen unberiihrt lassen, zu- 
grunde. Ein aktuelles Beispiel stellt die Telekomaktie dar, deren Kurs nach der Bilanzkonferenz am 
29.04.1998 um 9,7% stieg, ohne die Kurse anderer Aktien zu beeinflussen. 

Uber den Weg “firmenspezifischer Spriinge” lafit sich am besten die Sprungkomponente verdeut- 
lichen, verfiigt doch jede Aktie fiir sich genommen iiber eine eigene Sprungmoglichkeit. Dies ist 
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bei der Betrachtung von langer wahrenden Borsenkrisen, wie z.B. in Siidostasien, eine hilfreiche 
Eigenschaft, da auch in einem solchen Fall nicht jeden Tag ein Borsencrash eintritt, sondern Son- 
derbewegungen zu beobachten sind. - Fiir eine Analyse anderer Sprungarten ( “gruppenspezifische” 
und “Marktspriinge” ) diirfen wir auf Nietert (1996) oder Nietert (1997) verweisen. 

Formal lassen sich die bereits verbal erarbeiteten speziellen Eigenschaften von “firmenspezifischen 
Spriingen” (Auftreten nur zu endlich vielen Zeitpunkten und Variation in nicht infinitesimalem Um- 
fang) folgendermafien modellieren: Zum einen miissen die nicht laufend eintretenden Kursanderungen 
iiber Sprungeintrittswahrscheinlichkeiten eingefangen werden, zum anderen benotigen wir ein Modell 
der nicht infinitesimal kleinen Anderungen, der sogenannten Sprungamplituden. 
Sprungeintrittswahrscheinlichkeiten sind bei “firmenspezifischen Spriingen” als die Wahrscheinlichkeit 
eines alleinigen Sprungs von Aktie j definiert und werden von uns, der Anschaulichkeit willen, fiber 
einen Poisson-Prozefi mit konstanter Intensitat A abgebildet. Damit konnen wir Sprfinge fiber ein 
einfaches Schaltermodell; “Sprung tritt ein/Sprung tritt nicht ein” modellieren. Sprfinge nehmen 
freilich zwei verschiedene Auspragungen (Crash und Explosion) an. Wir prazisieren deshalb die 
soeben beschriebenen Sprungeintrittswahrscheinlichkeiten noch weiter: In der Periode zwischen t 
und t + dt tritt mit Wahrscheinlichkeit 1 — Xcjdt — Xsjdt kein “firmenspezifischer Sprung” (Wert- 
papier j folgt der Diffusionskomponente), mit Wahrscheinlichkeit XEjdt eine ( “firmenspezifische” ) 
Explosion und mit Wahrscheinlichkeit Xcjdt ein (“firmenspezifischer”) Crash auf. - Eine derartige 
Sichtweise hat neben der rein theoretischen Konst rukt ion auch praktische Relevanz: In volatilen 
Markten kann man oft nur starke Schwankungen prognostizieren, deren Richtung aber unklar ist. 
Ffir die Sprungamplituden gilt - ebenfalls aus didaktischen Grfinden dafi sie im Zeitablauf un- 
abhangig und identisch verteilt sind. Unter “firmenspezifischen Spriingen” sind dar fiber hinaus die 
Sprungamplituden zweier Aktien wechselseitig unkorreliert. 

Folglich erhalten wir als Vermogensveranderung in der Periode d t im Sprung/Diffusionsfall 



dW{t) = w^{t){a — lr)W{t)dt~\- {rW{t) — C{t))dt-\-W{t)w^{t)adz{t) 



( 1 ) 



mit Wahrscheinlichkeit 1 — ( 5^ A^;^ + ^ Xcj J dt (Diffusionsfall) 

\j=i j=i 






j=i 



/ n ” \ 

einlichkeit ^ Xej + ^ Xcj 
\j=i j=i / 



mit Wahrscheinlichkeit ( 5^ ^ Xcj dt (Sprungfall) 




334 



^ IVanspositionszeichen eines Vektors oder einer Matrix 

W (t) Vermogen in t 

w(t) n X 1 Vektor der Wertpapieranteile in t 

a n X 1 Vektor der im Zeitablauf konstanten Momentanerwartungs- 

werte der ex-Dividendenrenditen 
r risikoloser Momentanzinssatz 

1 n X 1 Vektor, dessen Komponenten alle den Wert Bins aufweisen 

d t infinitesimale Ver^derung der Zeit 

C(t) Konsum pro Zeiteinheit (“Momentankonsum”) im Zeitpunkt t; der 
Konsumbetrag kann nur fur Perioden und nicht fur Zeitpunkte er- 
mittelt werden (zwischen den Zeitpunkten t und betragt er 

C(t)dt). 

G n X n Diagonalmatrix der im Zeitablauf konstanten Momentan- 

standardabweichungen der ex-Dividendenrenditen 
dz{t) n X 1 Vektor der Inkremente korrelierter Wiener-Prozesse; es gilt: 

dzi dzj = Tjij {rjij: Korrelatidnskoeflizient zwischen Papier i und j) 

(pkj Zufallsvariable, die die Sprunghohen des Wertpapier j beschreibt, 
wenn ein Sprung der Klasse k (k = E, C) eintritt; der Kursverlust 
wird nach oben durch den aktuellen Kurs der jeweiligen Wertpapiere 
begrenzt, d.h. \(pcj\ < 1- 

Gleichung (1) fangt folgende Uberlegung ein: Zu alien Zeitpunkten folgt das Vermogen den “nor- 
malen” Veranderungen verursacht durch eine geometrisch Brownsche Bewegung, zu bestimmten 
“Sprungzeitpunkten” unterliegt es zusatzlichen Kursspriingen. Da diese “Sprungzeitpunkte” ex ante 
unbekannt sind, werden sie iiber die zuvor beschriebenen Eintrittswahrscheinlichkeiten eingefangen. 
Dabei ist zu beachten, dafi sich nichtinfinitesimale Veranderungen des Vermogens durch Spriinge 
bei jeder der n Aktien ergeben konnen. Damit postulieren wir nicht einfach die nichtinfinitesimale 
Vermogensveranderung, sondern leiten sie aus den Einzelbestandteilen des Vermogens - den einzelnen 
Aktien - her. 

Natiirlich bringt es die Modellierung der Eintrittswahrscheinlichkeiten als Poisson-Prozefi mit sich, 
dafi die Wahrscheinlichkeit fiir das gleichzeitige Auftreten zweier Spriinge vernachl^sigbar klein ist. 
Von daher werden die einzelnen Wahrscheinlichkeiten fiir die “firmenspezifischen Spriinge” einfach 
aufsummiert. Schliefilich gehen wir noch von vom “normalen” Aktienkurs unabhangigen propor- 
tionalen Spriingen aus. Die Unabhangigkeit beriicksichtigt den aufiergewohnlichen Charakter der 
Spriinge im Vergleich zum “normalen” Kursverlauf des Unternehmens. 

Mit den nun sowohl okonomisch als auch formal fundierten Spriingen konnen wir die Portfolio- 
Optimierung angehen: Eine Portfolio-Strategie ist dann optimal, wenn der Erwartungsnutzen des 
Entscheiders durch weiteres Umschichten nicht mehr gesteigert werden kann. Dieses Konzept formal- 
isieren wir mit Hilfe der um Spriinge erweiterten Hamilton/ Jacobi/Bellman-Gleichung samt Randbe- 
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dingung in t = T (vgl. z.B. Merton (1971)). Differenzieren wir die Hamilton/ Jacobi/Bellman- 
Gleichung nach den Wertpapieranteilen und formen um, ergibt sich als optimale Portfolio-Strategic 



w(t) = - 



Jw 

JwwW(t) 



— Ir) 



( 2 ) 



1. Term 



-E 



^ _ TTTT-r n 



JwwW(t) 



/ 0 \ 

Xej 

V 0 / 



2. Term 



+E 

j=i 



^ - Wj(t)ipcj)W{t),t I ^ ^ 



Jww^ (t) 



/ 0 \ 

^Cj 

\ 0 / 



3. Term 



Et{^EjJw\jE]} 
/ 0 \ 

^kj 

Vo/ 



Kurzschreibweise for ^]} 

n X 1 Vektor mit Xkj als j-ter Komponente und Null an 
alien anderen Stellen; k = E,C 



Die optimalen Wertpapieranteile bestehen aus zwei Grundbestandteilen: dem bekannten /x-cr-effi- 
zienten Portfolio der “normalen” Kursbewegungen (1. Term) und Korrekturtermen (2. Term: Ex- 
plosion und 3. Term: Crash), die den Kursspriingen Rechnung tragen. 

Die Korrekturterme zerfallen ihrerseits in zwei Teile: zum einen in eine investorunabhangige Struk- 

/ 0 \ 



turkomponente ft ^ 



^kj 



, die die durchschnittliche Anzahl von Spriingen pro Zeiteinheit dem 



V 0 / 

Risiko aus der “normalen” Preisbewegung gegeniiberstellt. Da die Strukturkomponente die j-te 

Spalte aus der inversen Varianz/Kovarianzmatrix gewichtet mit Xkj extrahiert, sieht man, dafi die 

Korrekturterme nicht ausschliefilich aus einer Anteilsveranderung des von Spriingen betroffenen Pa- 

piers bestehen. Zum anderen in eine investorabhangige volumenmaBige Gewichtung , 

JwwW (t) 

die die Relevanz dieser Spriinge anhand von Erwartungsnutzenabwagungen vor und nach Spriingen 



beurteilt. Diese unterschiedlichen volumenmafiigen Gewichtungen — 



Jw 



bzw. 



Et{Jw[j]} 



Jww^{t) Jww^{t) 
stellen auch sicher, dafi sich die Korrektur-Portfolios nicht zu einem einheitlichen Portfolio zusam- 

menfassen lassen. Die Optimierung unter Spriingen verlangt deshalb eine vollig neue Portfolio- 
Strategic: Es reicht nicht aus, Spriinge als seltene Ereignisse vollig zu ignorieren oder lediglich die 
Wertpapiercharakteristika hinsichtlich Spriingen zu modifizieren und die Optimierungskalkiile “nor- 
maler” Kursbewegungen anzuwenden, sondern Spriingen ist durch explizite Korrekturterme Rech- 
nung zu tragen. Genauer gesagt gibt es fur den Sprung eines jeden Papiers j einen eigenen Kor- 
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rekturterm j. Fiir eine weitergehende theoretische Interpretation der optimalen Wertpapieranteile 
diirfen wir auf Nietert (1996) und Nietert (1997) verweisen. 

2.2 Numerische Analyse 

Illustrieren wir die bisherigen theoretischen Ergebnisse noch numerisch: Da die Berechnungen aus- 
schliefilich Verdeutlichungszwecken dienen, fiihren wir sie in einem moglichst einfachen Modellrahmen 
durch: Wir wahlen den Zwei-Wertpapier-Fall. Zudem lassen sich in diesem Umfeld die Korrelations- 
beziehungen zwischen den “normalen Risiken” der Papiere eindeutig steuern und dariiber hinaus 
Ergebnisse leicht okonomisch interpretieren. Aufierdem verwenden wir konstante Sprungamplitu- 
den (als einfachstes Modell der Sprunghohe). Dadurch werden gut extreme Szenarien abbildbar: 
eine Amplitude, die wohl die Untergrenze fiir einen Sprung darstellt (1%), und ein sehr heftigen 
Kursausschlag (50%). 

Schliefilich mufi eine konkrete Nutzenfunktion vorgegeben werden, um die volumenmafiigen Gewich- 
tungen explizieren zu konnen. Wir greifen dazu auf eine isoelastische Konsum- und Vermogens- 
nutzenfunktion zuriick, die zu folgendem optimalen Wertpapieranteil des Papiers k fiihrt (fiir eine 
Anwendung der Formeln ist zu beachten: Im Fall der Explosion gilt und (pk = 

Fall des Crashes: Xk = Xc und (fk = —^c)- 

1 2 ^ 2 

Wi{t) = - ” ^) + ^ ^ (^) 

r j=i ^ 7 j=i 

Vij j-tes Element der i-ten Zeile der inversen Varianz/Kovarianz-Matrix 

1 — 7 relative Risikoaversion 

Die Zahlen des Anhangs zeigen, dafi Sprungterme einen erheblichen Einflufi auf die optimalen Wert- 
papieranteile haben: Bereits bei relativ niedrigen Sprungwerten (z.B. 5%ige Eintrittswahrschein- 
lichkeit und l%ige Amplitude) weichen die optimalen Wertpapieranteile ohne und mit Sprung deut- 
lich voneinander ab, wobei dieses Resultat sowohl fiir schwache als auch fiir starker ausgepragte 
Risikoaversion gilt. - Im iibrigen lafit sich aus den identischen Vorzeichen bei unterschiedlicher 
Hohe der Wertpapieranteile fiir verschiedene Risikoaversionen (Zeilen der Tabelle) die Trennung in 
investorunabhangigen Strukturterm und in investorabhangige volumenmafiige Gewichtung ablesen. 
Dariiber hinaus wird ebenfalls klar, dafi zu niedrige Eintrittswahrscheinlichkeiten (« Strukturterm) 
den Effekt von Spriingen stark untergewichten und bedeutend schwerer wiegen als eine zu niedrige 
Amplitude (« volumenmafiige Gewichtung). Z.B. wurde die Eintrittswahrscheinlichkeit von 5/36% 
um das 36-fache auf 5% erhoht, die Amplitude um das 50-fache von 0,5 auf 0,01 abgesenkt. Trotzdem 
lagen die Wertpapieranteile von Papier 1 bei der Kombination (5/36%; 0,5) deutlich unter denen bei 
(5%; 0,01). 

Schliefilich sei noch die enorme Bedeutung der Wechselwirkungen von Spriingen mit den “normalen 
Risiken” anhand unserer Beispielsdaten verdeutlicht. Die “normalen Risiken” fiihren dazu, dafi 
Papiere bei Crashes nicht sofort leerverkauft werden bzw. bei Explosionen ausschliefilich in das ex- 
plodierende Papier investiert wird. Vielmehr sind komplementare und substitutive Beziehungen zu 
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beobachten: Bei positive! Korrelation der “normalen Risiken” von Papier 1 und 2 lost ein Crash bei 
Papier 1 eine Anteilserhohung von Papier 2 (und der sicheren Anlage) finanziert durch Anteilssenkun- 
gen bei Papier 1 aus. Beide unsicheren Papiere stehen namlich in einer substitutiven Beziehung: 
Eine erhohte Korrelation verschlechtert die Diversifikation der “normalen Risiken” und steigert das 
Gesamtrisiko. Da Spriinge unabhangig vom “normalen Risiko” sind, bleibt die grundsatzliche sub- 
stitutive Beziehung aus den “normalen Risiken” auch im Sprung/Diffusionsumfeld erhalten. Bei 
negative! Korrelation der “normalen Risiken” senkt man die Anteile beider Papiere zugunsten der 
sicheren Anlage. Ursachlich ist in diesem Fall die komplementare Beziehung beider Papiere: Durch 
die gegenlaufigen Kursbewegungen unter “normalen Risiken” werden beide Papiere zur Reduktion 
der Portfolio- Varianz unter die der einzelnen Papiere kombiniert. Wegen der Zusammenarbeit 
der Papiere verspricht der Besitz eines einzelnen Papiers nur einen geringeren “Nutzen” und die 
beobachteten Anteilsanpassungen zur Aufrechterhaltung der Hedge-Wirkung gegen die “normalen 
Risiken” erfolgen. - Falls dagegen beide Papiere unkorreliert sind, alimentiert die Crash-induzierte 
Anteilssenkung bei Papier 1 ausschliefilich die sichere Anlage; der Anted von Papier 2 bleibt konstant, 
da weder von der Diffusions- noch der Sprungseite eine Verbindung zu Papier 1 besteht. 

Analoge Aussagen gelten fiir Kursexplosionen: Substitutive Beziehungen fiihren zu einer Umschich- 
tung in das explodierende Papier 1 auf Kosten des nicht springenden Papiers 2 und der sicheren 
Anlage, komplementare Beziehungen zu einer simultanen Anteilserhohung beider Papiere auf Kosten 
der sicheren Anlage. Bei Unkorreliertheit andert sich der Anteil von Papier 2 nicht. 

Im iibrigen lassen die Extremdaten 100% Eintrittswahrscheinlichkeit und 50% Amplitude bei hoher 
Risikoaversion das (auch theoretisch ermittelbare, vgl. Nietert (1996), S. 88) Konvergenzverhalten 
gegen die Wertpapieranteile unter “normalen Risiken” erkennen. Die Wertpapieranteile sind namlich 
deutlich weniger extrem als im Fall einer Eintrittswahrscheinlichkeit von 5% und 50% Amplitude. 

3 Zusammenfassung 

Ziel unseres Beitrags war es, eine okonomisch fundierte Integration von Kursspriingen in die Portfolio- 
Selektion vorzunehmen. Dabei haben wir durch Berechnung optimaler Wertpapieranteile die Subop- 
timalitat einer Wertpapierstrategie bewiesen, die Spriinge als seltene Ereignisse vollig ignoriert oder 
lediglich Erwartungswerte und Varianzen/Kovarianzen der Aktien hinsichtlich Spriingen modifiziert 
und auf die neuen Wertpapiercharakteristika Optimierungskalkiile wie unter Diffusionsprozessen an- 
wendet. Vielmehr erfordern Kursausschlage eine vollig eigenstandige Umschichtungsstrategie: Das 
optimale Portfolio stellt eine Linearkombination aus dem /x-cr-efl5zienten Portfolio der Diffusion- 
skomponente und sprunginduzierten Korrekturtermen dar. 
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Anhang: Numerische Analyse 

Tabelle der Wertpapiergrunddaten: 



Wertpapier 1 


Wertpapier 2 


sichere Anlage 


ai = 0,11 


ai = 0,2 


02 = 0,12 


(T2 = 0,22 


r = 0,08 



Tabelle der Wertpapieranteile: 



ausschliefilich “normale Risiken” 


o 

II 


o> 

1 

II 


iji2 = 0.7 


o 

II 


to 

II 

1 

p 


7/12 = 0.7 


o 

II 

<N 


to 

II 

1 

O 


wi = 0, 2228 


wi = 0,75 


u;i =2,71836 


wi = 0,02228 


wi = 0,075 


Wi =0,27184 


W2 = 0, 68465 


W2 = 0,8264 


W2 = 2, 55631 


W2 = 0,068465 


«;2 =0,082645 


W2 = 0, 25563 


Wo = 0, 09253 


wo = -0,57644 


wo = -4,27467 


wo = 0,90953 


wo = 0,84235 


Wo = 0, 47253 



“normale Risiken” und zusatzliche Kurseinbriiche 


o 

II 


o> 

1 

II 


rfi2 = 0.7 


o 

II 


lO 

II 

1 

p 


f/i2 = 0.7 


II 

o 


Vl2 = -0.7 




mit Eintrittswahrscheinlichkeit 


= 5/36% und Amplitude = - 1% 




wi = 0, 19825791 
W2 = 0,70028216 
two = 0, 10146 


wi = 0, 737407 
W2 = 0, 82645 
wo = —0, 563853 


wi =2,693172 
W2 =2,540283 
ti/Q " 23345 


wi =0,019826 
W2 = 0,07003 
wo =0,910146 


Wi = 0,0737407 
W2 = 0,082645 
wo = 0,83614627 


wi = 0, 2631803 
W2 = 0, 25402882 
wo = 0, 4766531 




mit Eintrittswahrscheinlichkeit 


= 5/36% und Amplitude = -50% 




tyi = -0,688752 
W2 = 1,264743 
wo = 0,4240089 


wi =0,0941312 
W2 = 0, 82645 
wo = 0,079423 


Wi = 0, 752945 
W2 = 1,30559 
wo = -1,058538 


wi = —0,066205 
W2 = 0, 124775 
wo = 0,9414298 


wi = 0,00946404 
W2 = 0,082645 
wo = 0,9078913 


wi = 0, 083809 
W2 = 0, 135977 
wo = 0, 780214 


mit Eintrittswahrscheinlichkeit = 5% und Amplitude = -1% 


lui = -7,95084 
W2 = 5, 88607 
wo = 3,064768 


wi = -3,59388709 
W2 = 0, 82645 
wo = 3, 76744 


wi = -5,6345241 
W2 = -2, 75916 
wo = 9, 39368 


wi = -0, 79305377 
W2 = 0,5873152 
wo = 1,205739 


wi = -0,35915284 
W2 = 0,082645 
wo = 1,276508 


wi = -0, 56239487 
W2 = -0, 27524705 
wo = 1,837637885 


mit Eintrittswahrscheinlichkeit = 5% und Amplitude = -50% 


Wi = -28, 606306 
W2 = 19,0304592 
wo = 10, 575847 


wi = -19,8474086 
W2 = 0,82645 
wo = 20,02096 


wi = -27,34742 
W2 = -16,57646 
wo = 44,923874 


wi = -0,933858 
W2 = 0,6769179 
wo = 1,25694 


wi = -0,7752089 
W2 = 0,082645 
wo = 1,6925643 


wi = -0,8797861 
W2 = -0,477219 
Wo = 2, 357005 


mit Eintrittswahrscheinlichkeit = 100% und Amplitude = -1% 


Wi = -91, 789721 
W2 = 59, 2381 
wo = 33,551634 


wi = -56,68872 
W2 = 0, 82645 
wo = 56,862276 


wi = -90, 1081 
W2 = —56,5151 
wo = 147,623154 


wi = -8,0865236 
W2 = 5,228614 
Wo = 3, 85791 


wi = -5,296562 
W2 = 0,082645 
wo = 6,2139174 


wi = -7,9443813 
W2 = -4,972871 
wo = 13,91725 




mit Eintrittswahrscheinlichkeit 


= 100% und Amplitude = -50% 




wi = -131,733797 
W2 = 84,6570441 
wo = 48,07675 


wi = -93,49339 
W2 = 0, 82645 
wo = 93,66894 


wi = -130,48354 
W2 = -82, 208532 
wo = 213,692071 


wi = -1,089593 
W2 = 0,0684654 
wo = 0,90925296 


wi = -0,966364 
W2 =0,082645 
wo = 1,8837197 


wi = -1,685009 
W2 = -0, 9863398 
wo = 3, 6746432 




339 



“normale Risiken” und zusatzliche Kursexplosionen 


o 

II 


7 = -9 


7712 = 0.7 


T}i2 = 0 


d 

1 

II 


r)i2 = 0.7 


0 

II 


to 

II 

1 

p 




mit Eintrittswahrscheinlichkeit 


= 5/36% und Amplitude = 1% 




wi =0,466776 
W2 = 0, 529407 
two = 0,003817 


toi = 0,87392 
t02 = 0, 82645 
too = -0, 70036 


toi =2,9564 
t02 = 2, 7078 
too = —4,664 


toi = 0,046677 
t02 = 0,052941 
too = 0, 90038 


toi =0,087391 
t02 =0,082645 
too = 0, 82996 


toi = 0, 295633 
t02 =0,27077 
too = 0> 43359 




mit Eintrittswahrscheinlichkeit = 


= 5/36% und Amplitude = 50% 




wi = 4, 185364 
W2 = -1,83697 
wo = -1,34839 


toi = 3, 1685 
t02 = 0, 82645 
too = —2,9949 


toi = 5, 8343 
t02 =4,54491 
too = -9, 3882 


toi =0,311041 
t02 = -0, 11529 
too = 0, 80425 


toi = 0, 25951 
t02 =0,082645 
too = 0,657841 


toi =0,439766 
t02 = 0, 362496 
too = 0, 19774 


mit Eintrittswahrscheinlichkeit = 5% und Amplitude = 1% 


Wi = 8, 36522 
W2 = -4, 49687 
too = —2, 86834 


toi = 5,0343 
t02 = 0, 82645 
too = —4,8608 


toi = 10,68977 
t02 = 7, 629 
too = -17,319 


toi =0,83429 
t02 = -0,4482665 
too = 0, 61398 


toi = 0, 50298 
t02 =0,082645 
too = 0,414376 


toi = 1,06546 
t02 = 0, 76066 
too = -0, 826125 




mit Eintrittswahrscheinlichkeit 


= 5% und Amplitude = 50% 




toi = 28, 83662 
W2 = -17,5241 
too = -10,31249 


toi = 20, 632719 
t02 = 0, 82645 
too = -20,4592 


toi = 30, 15714 
t02 = 20,01735 
too = -49, 1745 


toi = 0, 944481 
t02 = -0,518389 
too = 0,5739078 


toi = 0,81428 
t02 = 0,082645 
too = 0, 10308 


toi = 1,0105 
t02 = 0, 72572 
too = -0, 736254 




mit Eintrittswahrscheinlichkeit 


= 100% und Amplitude = 1% 




toi = 92, 09221 
102 = -57, 777685 
too = -33, 314525 


toi = 57,78841 
t02 = 0,82645 
too = -57,6148562 


toi =93,78398 
t02 = 60, 5071621 
too = -153,29114 


toi =8,112 
t02 = -5,07954 
too = -2,03246 


toi = 5,36064838 
t02 = 0, 082645 
too = -4,47871 


toi = 8,2551867 
t02 = 5, 33594 
too = -12,591132 




mit Eintrittswahrscheinlichkeit 


= 100% und Amplitude = 50% 




toi = 131,95829 
t02 = -83, 14701 
too = -47,8113 


toi = 94,25329375 
t02 = 0, 82645 
too = -94,07974 


toi = 133, 22236 
t02 = 85, 6043 
too = -217,8267 


toi = 1,734541 
t02 = -1,021154 
too = 0, 286613 


toi = 1,54502 
t02 =0,082645 
too = -0, 627666 


toi = 1,781793 
t02 = 1,21651298 
too = -1,9983 
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Optionsbewertung mit Hilfe der genetischen 
P r ogrammierung 



Christian Keber, Universitat Wien 

Zusammenfassung: In diesem Beitrag werden analytische Approximationen zur Bewertung von 

amerikanischen Aktienverkaufsoptionen ohne Dividenden wahrend der Restlaufzeit der Option er- 
mittelt, wobei als in diesen Zusammenhang neues methodisches Werkzeug die genetische Program- 
mierung herangezogen wird. Auf der Basis umfangreicher Datensatze kann gezeigt werden, dafi 
alle genetisch ermittelten analytischen Ndherungslosungen alternative, in der Literatur vorgestellte 
analytische Approximationsformeln dominieren und zu signifikanten Verbesserungen bei der approxi- 
mativen Optionsbewertung fiihren. Zudem kann in diesem Beitrag als Nebenprodukt ein analytischer 
Ausdruck gewonnen werden, der den numerisch ermittelten, “exakten” Killing Preis recht gut ap- 
proximiert. 

1. Einfuhrung und Problemstellung 

Im Jahre 1973 wurden von Black und Scholes [6] sowie Merton [16] Arbeiten vorgelegt, 
in denen die analytisch exakte Losung fiir die Bewertung von Europaischen Aktienkaufoptionen 
ohne Dividende wahrend der Restlaufzeit der Option pr^entiert wurde. Merton [16] hat zu- 
dem gezeigt, dafi eine vorzeitige Ausiibung von amerikanischen Aktienkaufoptionen ohne Divi- 
dende wahrend der Restlaufzeit der Option nicht optimal ist und die Bewertung ebenfalls fiber den 
BLACK/SCHOLES/MERTON-Ansatz erfolgen kann. Werden bei amerikanischen Aktienkaufoptionen 
wahrend der Restlaufzeit (diskrete) Dividenden gezahlt und sind diese mit Sicherheit bekannt, so 
kann ihr Wert fiber das analytisch exakte Bewertungsmodell von Roll [17] bestimmt werden. 

Im Vergleich zu amerikanischen Aktienkaufoptionen ist bei amerikanischen Aktienverkaufsoptionen 
ohne Dividende wahrend der Restlaufzeit der Option eine vorzeitige Ausiibung der Optionen wahrend 
der Restlaufzeit vorteilhaft, falls der Aktienkurs unter einem bestimmten, sich permanent andernden 
kritischen Wert fallt {Killing Preis). Als Folge des Unterschiedes bei der vorzeitigen Ausiibung 
von amerikanischen Aktienkauf- und -verkaufsoptionen ist die Bestimmung der optimalen (vorzeit- 
igen) Ausiibung bzw. des Killing Preises Bestandteil des zu losenden Bewertungsproblems, woffir 
bislang kein analytisch exaktes Modell gefunden worden ist. Die Bewertung von amerikanischen 
Aktienverkaufsoptionen erfolgt daher entweder mit Hilfe numerischer Losungsverfahren oder fiber 
analytische Approximationen. Die bekanntesten numerischen Verfahren sind der Binomial- Ansatz 
[8] und die Finite Differenzen Methode [7]. Der wesentlichste Nachteil von numerischen Verfahren 
zur Bewertung von amerikanischen Aktienverkaufsoptionen ist darin zu sehen, dafi sie rechenintensiv 
sind. Eine einfache analytische Approximation zur Bewertung von amerikanischen Aktienverkaufsop- 
tionen ohne Dividende wahrend der Restlaufzeit wurde von Johnson [13] vorgestellt und besteht 
in einer Linearkombination zwischen den Werten zweier Europaischer Aktienverkaufsoptionen mit 
unterschiedlichen Basispreisen (sonst jedoch identischen Bestimmungsparametern). Diese einfache 
analytische Approximation hat den Nachteil, dafi die Approximationsgfite nicht besonders hoch ist, 
sodafi anspruchsvollere analytische Approximationen entwickelt worden sind. Die bekanntesten von 
ihnen stammen von MacMillan sowie von Geske und Johnson. 
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Die von MacMillan [15] vorgestellte analytische Approximation zur Bewertung amerikanischer 
Aktienverkaufsoptionen besteht darin, den Wert einer (entsprechenden) Europaischen Aktienverkauf- 
soption um den approximativ bestimmten Wert einer friihzeitigen Ausiibung der Option zu erhohen. 
Dieser Ansatz bildete in der Folge die Basis fiir analytische Approximationen zur Bewertung einer 
Reihe von anderen amerikanischen Optionen ([19], [4]). Die analytische Approximation von Geske 
und Johnson [10] basiert auf der Annahme, dafi eine friihzeitige Ausiibung der Option nur zu 
bestimmten diskreten Zeitpunkten erfolgen kann. Demgemafi wird fiir jeden dieser Zeitpunkte ein 
Optionswert berechnet und darauf basierend durch Verfahren der polynomialen Extrapolation der 
Wert der amerikanischen Aktienverkaufsoption. Der von Geske und Johnson prasentierte Ansatz 
behandelt die Bewertung von amerikanischen Aktienverkaufsoptionen sowohl mit als auch ohne Div- 
idenden wahrend der Restlaufzeit der Option. Der MACMiLLAN-Ansatz hingegen beschrankt sich 
nur auf Optionen ohne Dividenden wahrend der Restlaufzeit, wurde aber fiir den Fall mit Dividenden 
von Barone-Adesi und Whaley [5] sowie von Fischer [9] erweitert. 

Im vorliegenden Beitrag werden analytische Approximationen zur Bewertung amerikanischer Aktien- 
verkaufsoptionen ohne Dividenden wahrend der Restlaufzeit ermittelt, wobei als in diesem Zusam- 
menhang neues methodisches Werkzeug die genetische Programmierung herangezogen wird. Auf der 
Basis umfangreicher Datensatze kann gezeigt werden, dafi die genetisch bestimmten analytischen Ap- 
proximationen zu Bewertung amerikanischer Aktienverkaufsoptionen ohne Dividenden wahrend der 
Restlaufzeit alternative, in der Literatur vorgestellte approximative Ansatze dominieren und zu sig- 
nifikanten Verbesserungen bei der approximativen Optionsbewertung fiihren. Im zweiten Abschnitt 
des Beitrages wird die genetische Programmierung vorgestellt, der dritte Abschnitt enthalt konkrete 
Implementierungsaspekte und prasentiert die genetisch bestimmten analytischen Approximationen, 
und im vierten Abschnitt werden die Anwendungsergebnisse dargelegt und diskutiert. Der Beitrag 
endet mit einer Zusammenfassung und einigen abschliefienden Bemerkungen. 

2. Genetische Programmierung 

Die grundlegenden Arbeiten fiber die genetische Programmierung stammen von KOZA [14] und 
basieren auf genetischen Algorithmen, die erstmals von Holland [12] vorgestellt und in der Folge zur 
Losung einer grofien Anzahl unterschiedlicher Probleme erfolgreich eingesetzt wurden [2]. Genetis- 
che Algorithmen wie auch die genetische Programmierung sind parallele, zufallsprinziporientierte 
Suchverfahren mit fiberwiegend heuristischem Charakter und basieren auf Prinzipien der Evolution 
und Vererbung. Die grundlegende Idee liegt in der Imitation der den Evolutionsprozefi ausmachenden 
Strategie, dafi die besten an die Umwelt angepafiten Individuen einer Population fiberleben und ihr 
Erbmaterial an die nachste Generation weitergeben. Im Laufe der Evolution entstehen auf diese Art 
Populationen, die immer besser an die Umwelt angepafit sind. Fiir die vorliegende Problemstellung 
geht man von einer Programmiersprache C aus, durch die die Menge aller analytischen Ausdrucke 
beschrieben werden kann, die aus den Elementen einer Terminalsymbolmenge T aufgebaut werden 
kann [1]. Im Sinne der genetischen Programmierung sind C der Suchraum aller potentiell gener- 
ierbaren analytischen Ausdrficke, c £ C ein Chromosom oder Individuum^ Pt C C die Population 
der r-ten Generation (r = 0, . . . , Tmax)? und die in c enhaltenen Terminalsymbole reprasentieren die 




343 



Gene des Chromosoms. 

Zur Beurteilung, wie gut ein Individuum an die Umwelt angepafit ist (d.h. wie gut ein analytischer 
Ausdruck den wahren Optionspreis approximiert), wird eine Fitnefifunktion f :A-^ M definiert, die 
die genetische Struktur eines Chromosoms (analytischen Ausdrucks) bewertet. Ausgangspunkt dabei 
ist die Uberlegung, dafi jedes Individuum c G £ als Funktion c \Si ^ Ai zn begreifen ist, weil es 
Eingabedatensatze Si verarbeitet und in Ausgabedaten Ai transformiert. Da die Fitnefi als Mafi fiir 
die Angepafit heit eines Individuums an seine Umwelt dient, sind die Ausgabedaten Ai mit Hilfe einer 
geeigneten Funktion A(^,^f) mit entsprechenden Soli- Ausgabedaten Af (Umwelt) zu vergleichen. 
Wird nun A(^, Af) als Abweichungsfunktion derart gestaltet, dafi sie umso hohere Werte liefert je 
mehr sich die Ausgabedaten von den Soil- Ausgabedaten unterscheiden, so erhalt man mit 

/r(c) = mit Ai = c{£i) (1) 

t=l 

die Rohfitnefi des Individums c. Fiir sie ist charakteristisch, dafi dem kleineren Fitnefiwert das bessere 
Individuum entspricht. In den meisten Anwendungen wird die Rohfitnefi jedoch dahingehend modi- 
fiziert, dafi die Fitnefi werte zwischen Null und Bins liegen und grofiere Fitnefiwerte bessere Individuen 
widerspiegeln {adjusted fitness). Eine hohe Fitnefi bringt nun die Fahigkeit eines Individuums zum 
Ausdruck, vermehrt Nachkommen zu erzeugen. Dementsprechend wird zunachst fiir jedes Indi- 
viduum ce Pt proportional zu seiner Fitnefi die Wahrscheinlichkeit Pcr ermittelt, und auf der Basis 
der sich ergebenden Wahrscheinlichkeitsverteilung werden \Pr\ Individuen ausgewahlt {fitnefipropor- 
tionale Selektion), aus denen unter Zuhilfenahme der genetischen Operatoren Crossover \md Mutation 
\Pr\ Nachkommen erzeugt werden {Reproduktion hzw. Rekombination). Die Crossover-Operation hat 
die Aufgabe Individuen zu kreuzen und wird dadurch realisiert, dafi auf der Basis einer a priori fest- 
gelegten Crossover^ Wahrscheinlichkeit Pc Erbinformationen, d.h. analytische Teifetusdriicke zwischen 
den Individuen ausgetauscht werden [14]. Demgegeniiber hat der Mutationsoperator im wesentlichen 
die Aufgabe, ’’lokale Optima” wieder verlassen zu konnen. Dabei werden auf der Basis einer a priori 
festgelegten, sehr kleinen Mutationswahrscheinlichkeit Pm einzelne Gene in den Chromosomen der In- 
dividuen (Terminalsymbole) durch andere zufallig ausgewahlte Terminalsymbole (Gene) ersetzt. Die 
genetischen Operatoren Crossover und Mutation konnen somit als ’’Suchoperatoren” interpretiert 
werden, mit dem neue, wesentlich besser an die Umwelt angepafite Individuen gefunden werden. Die 
Leistungsfahigkeit der genetischen Programmierung als Suchverfahren basiert auf den Effekten der 
fitnefiproportionalen Selektion, Reproduktion und Rekombination und kommt dadurch zum Aus- 
druck, dafi sich die Losungsgiite mit iiberproportionaler Geschwindigkeit (gemessen an der Anzahl 
der Generationen) verbessert [12]. 

3. Die genetische Bestimmung der analytischen Approximationen 

3.1 Implementation der genetischen Programmierung und Parameterspezifikation 

In der vorliegenden Untersuchung basiert die Ermittlung der Rohfitnefi auf einem Sample von 1000 
Datensatzen, die als Wertetupel der Form (Po, 5o, X, r, T, a) zufallig generiert worden sind. So beze- 
ichnet den Preis des Basisobjekts (der Aktie) z\it = 0, X den Basispreis der Option, r den stetigen 
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risikolosen Zinsatz in % p.a., T die Restlaufzeit der Option in Jahren und a die Volatilitat des Ba- 
sisobjekts in % p.a. Pq wurde auf der Basis von 5o, X, r, T und a iiber die Finite DifFerenzen 
Methode (mit At = (l/365)/5 = 0,00054795, AS = 0,01 und 5max = 2 • So) berechnet und beze- 
ichnet den (numerisch) exakten Wert einer amerikanischen Aktienverkaufsoption zu t = 0. Bei der 
Zufallsinitialisierung der Wertetupel wurde eine Gleichverteilung herangezogen und aufgrund prak- 
tischer Gegebenheiten die folgenden Wertebereiche gewahlt: 3 <r<7, 0<T<1 und 5 < <7 < 50. 
Hinsichtlich der Zufallsinitialisierung von So und X wurde von der Beziehung a = So/X ausgegan- 
gen und der Wertebereich 0,9 < a < 1,1 herangezogen, weil empirische Untersuchungen belegen 
([3], [18]), dafi bei realen Transaktionen von Aktienoptionen a in bis zu 78 % der Falle in diesem 
Bereich liegt. Als Wertebereich fiir den Preis des Basisobjekts wurde bei der Zufallsinitialisierung 
10 < 5*0 < 100 gewahlt und iiber den Parameter a der entsprechende Basispreis X berechnet. 

Fiir die GP-Applikation wurde jedes Wertetupel in einen Eingabedatensatz Si (= (5o, X, r, T, g)) und 
in den korrespondierenden Soll-Ausgabedatensatz Af (= (Po)) transformiert (i = 1, . . . , 1000) und 
als Fitnefifunktion — Af)"^ verwendet. Die Terminalsymbolmenge beinhaltet die Variablen 

So, X, r, T, a und a, die numerischen Konstanten tt und e, die mathematischen Operatoren -h, — , 
♦, v^, ln(x), x^, die Verteilungsfunktion der univariaten Standardnormalverteilung ^(x), die 

logischen Operatoren <, <, =, >, > sowie die zur Fallunterscheidung notwendigen Terminalsymbole 
IF, THEN, ELSE. Die PopulationsgroBe wurde mit 50 Individuen fixiert, und fiir die Crossover- und 
Mutationswahrscheinlichkeit wurden die Werte = 0, 9 und pm = 0, 01 verwendet. 

3.2 Die genetisch bestimmten analytischen Approximationen 

In der Implement ierung der im vorhergehenden Abschnitt bzw. Unterabschnitt beschriebenen Form 
lieferte die genetische Programmierung nach Tmax = 20000 Generationen fiir das beste Individuum 
eine Rohfitnefi von 5, 1339 x 10“^. Die diesem Individuum entsprechende analytische Approximation 
zur Bewertung amerikanischer Aktienverkaufsoptionen ohne Dividenden wahrend der Restlaufzeit 
wird mit Pq^^ bezeichnet und lautet (fiir die einzelnen Parameter vgl. den Anhang): 

« p(i) ^ / Po - {if ^ (2) 

[ X — iSo sonst 

Fiir Pq^^ ist erkennbar, dafi sich der Wert einer amerikanischen Aktienverkaufsoption durch eine Ko- 
rrektur des Wertes der (entsprechenden) Europaischen Aktienverkaufsoption po ergibt. Im Vergleich 
zu den Ansatzen von Barone-Adesi und Whaley [5] sowie MacMillan [15] (BAwMM-Ansatz) 
unterscheidet sich Pq^^ im ’’Korrekturwert” und in der der Fallunterscheidung zugrundeliegenden 
logischen Bedingung. Daran ankniipfend haben numerische Analysen ergeben, dafi zwischen /3,X 
und dem dem BAwMM-Ansatz zugrundeliegenden numerisch ermittelten Killing Preis S* die ap- 
proximative Beziehung 

s*^s*^,,y = px (3) 

gilt, und es ist bemerkenswert, dafi die Beziehung (3) eine iiber die genetische Programmierung 
(implizit) gewonnene analytische Approximation zur Bestimmung des Killing Preises darstellt. 
Beriicksichtigt man in (2) den Zusammenhang ^ S*/X sowie die Beziehung a = 5o/X, so ergibt 
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sich die adaptierte genet ische Approximation (fiir die einzelnen Parameter vgl. den Anhang) 



P^(2) ^ I Po - 

[ A — 5o sonst, 



(4) 



und es ist zu erwahnen, dafi die Approximation bis auf die Parameter 5(-) und c mit der des 
BAwMM-Ansatzes iibereinstimmt. kommt durch die nachtrdgliche Verwendung traditioneller 
Bestandteile der approximativen Optionsbewertung zustande und kann daher als genetisch bestimmte^ 
ex post hybridisierte analytische Approximation zur Bestimmung des Wertes einer amerikanischen 
Aktienverkaufsoption betrachtet werden. Die (ex post) ermittelte Rohfitnefi fiir Pq^^ betragt 3, 2419 x 
10“^, und im Vergleich zu Pq^^ ergibt sich eine Verbesserung bei der approximativen Bestimmung 
des Wertes einer amerikanischen Aktienverkaufsoption. 

Aufgrund der erzielten Fitnefiverbesserung bei der ex post Hybridisierung wurde eine ex ante Hy- 
bridisierung der genetischen Bestimmung von analytischen Approximationen zur Bewertung von 
amerikanischen Aktienverkaufsoptionen dergestalt durchgefiihrt, dafi 5, nunmehr durch die Teiltupel 
(5o, 5*, A, r, T, a) reprasentiert wird, wobei S* den nach dem BAwMM-Ansatz numerisch ermittelten 
Killing Preis (c.p.) darstellt. Unter Beriicksichtigung dieser Modifikation wurde nach Tmax = 20000 
Generationen fiir das beste Individuum eine im Vergleich zu P^^ nochmals verbesserte Rohfitnefi 
von 2, 0043 x 10“^ festgestellt, und die diesem Individuum entsprechende analytische Approximation 
Pq^^ lautet (fiir die einzelnen Parameter vgl. den Anhang): 

p„«p(3)=|po-i;(i^r so>s- 

1 A — 5o sonst 



Ahnlich wie die genetisch bestimmten Approximationen Pq^^ und Pq^^ ist auch die mit der ex ante 
Hybridisierung ermittelte analytische Approximation Pq^^ strukturell mit dem BAwMM-Ansatz ver- 
gleichbar. 



4. Anwendungsergebnisse 

In diesem Abschnitt wird untersucht, inwieweit die genetisch bestimmten Approximationen zur Be- 
wertung amerikanischer Aktienverkaufsoptionen ohne Dividenden wahrend der Restlaufzeit der Op- 
tion gegeniiber anderen in der Literatur vorgestellten analytischen Ansatzen vorteilhaft sind. Zu 
diesem Zweck werden methodenseitig die in der Literatur haufig zitierten Approximationen Pqj von 
Johnson [13], Pqgj von Geske und Johnson [10], Pqbaw von Barone-Adesi und Whaley 
[5] bzw. MacMillan [15] und datenseitig einerseits die in Geske und Johnson [10] und die in 
Barone-Adesi und Whaley [5] enthaltenen Datenkonstellationen und andererseits ein umfan- 
greicher - insgesamt 20000 Optionen umfassender - und vom Trainingsdatensample unabhangiger 
Validierungsdatensatz herangezogen. Im Validierungsdatensatz werden fiir einige Parameter allerd- 
ings Wertebereiche verwendet, die grofier sind als jene, die den Testdatensatz zugrundeliegen. Die 
Vergrofierung dieser Wertebereiche im Validierungsdatensatz hat ini Hinblick auf die genetisch ermit- 
telten Naherungsformeln insbesondere zum Ziel, die Sensitivitat der Input parameter auf das bei Ver- 
wendung der genetisch bestimmten Naherungsformeln erzielte Approximationsergebnis aufzuzeigen 
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bzw. zu beriicksichtigen. Im Validierungsdatensatz wurden bei der Zufallsinitialisierung fiir die 
einzelnen Parameter die folgenden Wertebereiche gewahlt: 3 <r<8, 0<T<1 und 5 < cr < 50. 
Basis fiir die Zufallsinitialisierung von So und X bildete wieder die Beziehung a = So/ X, wobei als 
Wertebereich 0,8 < a < 1,2 verwendet worden ist. Der Wertebereich fiir den Preis des Basisob- 
jekts wurde mit 10 < 5 q < 100 festgelegt, und der entsprechende Basispreis X ergibt sich iiber den 
Parameter a. Mit diesem Validierungsdatensatz soil sichergestellt werden, dafi die abschlieBende 
Beurteilung der Approximationsgiite hohe Aussagekraft besitzt. Ferner wird auch dariiber berichtet, 
wie gut die genetisch bestimmte Naherungslosung S*^Q^y den numerisch ermittelten Killing Preis 
approximiert. 

In der Tabelle 1 (2) werden die genetisch ermittelten Naherungslosungen P^\ P^^^ und S*j.^^ 
sowie die in der Literatur haufig zitierten analytischen Approximationen Poj> Pqgj und Pobaw auf 
den Datensatz von Geske und Johnson [10] (Barone-Adesi und Whaley [5]) angewendet. Pq 
kennzeichnet den numerisch iiber die Finite Differenzen Methode berechneten ’’exakten” Option- 
spreis, S* den numerisch ermittelten ’’exakten” Killing Preis, und am Ende jeder Tabelle werden fiir 
jede Approximation die drei Abweichungsmafie MAE (mittlere absolute Abweichung), MSE (mittlere 
quadrierte Abweichung) und MAPE (mittlere absolute prozentuelle Abweichung) ausgewiesen. 



r 


T 


47 


X 


S * 


S * 


Po 


Pqj 


Pqbaw 


Pqgj 




p ( 2 ) 


n(S) 


0,0488 
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0,20 


35,00 
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MAE 










0,0553 




0,0692 
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0,0040 
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MSE ( Xl 0 “®) 








449,7680 




1351,8420 


15,4237 


4,4433 


0,7912 


1,1971 


0,3970 


MAPE 










0,0018 




0,0217 


0,0045 


0,0025 


0,0028 


0,0029 


0,0022 



Tabelle 1: Approximativ bestimmte Options- und Killing Preise (So = 40) 

Fiir beide Datensatze kann zunachst festgestellt werden (vgl. Tabellen 1 und 2), daB die 
Naherungslosung von Johnson im Vergleich zu den anderen Ansatzen die groBten Abweichungen 
vom numerisch exakten Optionspreis verursacht und damit die schlechtesten Approximationsergeb- 
nisse liefert. Die mittleren absoluten Abweichungen liegen bei der Approximation nach Johnson 
bei ca. 0,07 bzw. 0,7. Die nachstbesten Approximationsergebnisse werden mit der BawMm- 
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T 
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Tabelle 2: Approximativ bestimmte Options- und Killing Preise {X = 100) 



Approximation (MAE von ca. 0,01 bzw. 0,2) und der nach Geske und Johnson (MAE von ca. 0,004 
bzw. 0,02) realisiert, und die genetisch bestimmten Approximationen liefern die geringsten mittleren 
Abweichungen von den numerisch exakten Optionspreisen (MAE von ca. 0,002 bis 0,01). Dariiber 
hinaus ist fiir den approximativ ermittelten Killing Preis festzuhalten, dafi die grofiten Abweichungen 
vom numerisch ermittelten, exakten Killing Preis lediglich im Einstelligen-Nachkommabereich liegen 
(MAE von ca. 0,06 bzw. 0,28). 

Hinsichtlich einer allgemeinen Beurteilung der Approximationsgiite wurden fiir jeden der sechs Ap- 
proximationsansatze auf der Basis des insgesamt 20000 Optionen umfassenden Validierungsdaten- 
satzes die absoluten Abweichungen zwischen den approximativen und numerisch bestimmten, exak- 
ten Optionspreisen ermittelt und zu einer Abweichungsverteilung aggregiert. Diese sind im linken 
Teil der Abbildung 1 graphisch dargestellt. Der rechte Teil in der Abbildung 1 zeigt fiir den Va- 
lidierungsdatensatz die Abweichungsverteilung, die sich aus den absoluten Abweichungen zwischen 
den approximativ bestimmten Killing Preisen ^p^oxy und den numerisch ermittelten, exakten Killing 
Preisen S* ergibt. 

Aus der Abbildung 1 wird ersichtlich, dafi alle genetisch ermittelten Naherungsformeln zur Bewertung 
amerikanischer Aktienverkaufsoptionen ohne Dividenden wahrend der Restlaufzeit alle betrachteten 
Literaturansatze dominieren. Absolute Abweichungen vom numerisch ermittelten, exakten Option- 
spreis von weniger als 0,01 Geldeinheiten werden mit dem Poj“Ansatz mit einer Wahrscheinlichkeit 
von ca. 28 %, mit dem Poj?>iw“Ansatz mit ca. 58 % und mit dem PoGJ“Ansatz mit ca. 75 % erre- 
icht. Im Vergleich dazu kann mit den genetisch ermittelten Approximationen mit weitaus hoheren 
Wahrscheinlichkeiten damit gerechnet werden, dafi die Abweichungen weniger als 0,01 Geldeinheiten 
betragen: Mit dem Pg^^-Ansatz kann mit ca. 82 %-iger, mit dem Pg^^-Ansatz mit ca. 85 %-iger 
und mit dem Po^^-Ansatz mit ca. 92 %-iger Wahrscheinlichkeit erwartet werden, dafi die Abwe- 
ichungen weniger als 0,01 Geldeinheiten ausmachen. Die genetisch bestimmte Naherungsformel zur 
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Abbildung 1: Abweichungsverteilungen fiir approximativ bestimmte Options- und Killing Preise 

Ermittlung des Killing Preises liefert mit einer Wahrscheinlichkeit von nahezu 100 % Losungen, bei 
denen die Abweichungen zum numerisch exakten Killing Preis den Einstelligen-Nachkommabereich 
nicht iiberschreiten. Mit 90 %-iger (95 %-iger) Wahrscheinlichkeit kann konstatiert werden, dafi die 
absoluten Abweichungen zum numerisch ermittelten, exakten Killing Preis weniger als 0,23 (0,32) 
Geldeinheiten ausmachen. 



5. Zusammenfassung 

Im vorliegenden Beitrag wurden analytische Approximationen zur Bewertung amerikanischer Aktien- 
verkaufsoptionen ohne Dividenden wahrend der Restlaufzeit der Option ermittelt, und als in diesem 
Zusammenhang neues methodisches Werkzeug wurde die genetische Programmierung herangezogen. 
Anhand exemplarischer Datensatze und eines insgesamt 20000 Optionen umfassenden Validierungs- 
datensatzes konnte fiir alle genetisch ermittelten analytischen Naherungslosungen gezeigt werden, 
dafi sie die in der Literatur haufig zitierten approximativen analytischen Optionsbewertungsmodelle 
von Johnson, Geske und Johnson sowie Barone-Adesi und Whaley bzw. MacMillan hin- 
sichtlich der Approximationsgiite dominieren und zu signifikanten Verbesserungen bei der analytis- 
chen Bewertung amerikanischer Aktienverkaufsoptionen fiihren. Dariiber hinaus konnte in diesem 
Beitrag als Nebenprodukt ein analytischer Ausdruck gewonnen werden, der den numerisch ermittel- 
ten, exakten Killing Preis recht gut approximiert. Unter diesem Aspekt ist bemerkenswert, dafi eine 
der drei Approximationen zur Optionspreisermittlung einen rein analytischen Ansatz darstellt, weil 
auch der einfliefiende Killing Preis analytisch bestimmt wird. 

Anhang 

Die in und Pq^^ enthaltenen, in alphabetischer Reihenfolge angegebenen Parameter ergeben sich 
aus (die Berechnungsreihenfolge ist ly, u, u, A, bzw. A', c, D{Y) und B{Y)): 
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Die in enthaltenen, in alphabetischer Reihenfolge angegebenen Parameter ergeben sich aus (die 
Berechnungsreihenfolge ist Di, E, B\ c, und fiir A’ vgl. oben): 
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Summary: An important issue is the extent to which problem specific knowledge must be used 

in the construction of learning algorithms (in other words the power and quality of inferencing 
rules) capable to provide significant performance improvements. Very general methods having a 
wide range of applicability in general are weak with respect to their performance. Problem specific 
methods, like branch and bound, achieve a highly efficient learning but with little use in other problem 
domains. Local search strategies are falling somewhat in between these two extremes, where genetic 
algorithms or neural networks tend to belong to the former category while tabu search or simulated 
annealing etc. are counted as instances of the second category. What is required are techniques for 
rapidly finding high-quality solutions in large, complex search spaces and without any guarantee 
of optimality. When sufficient knowledge about such search spaces is available a priori, one can 
often exploit that knowledge (inference) in order to introduce problem specific search strategies that 
can quickly find solutions of higher quality. Without such a priori knowledge, or in cases where 
close to optimum solutions are indispensable, information about the problem has to be accumulated 
dynamically during the search process. Likewise, obtained long-term as well as short-term memorised 
knowledge constitutes one of the basic parts in order to control the search process and in order to 
avoid getting stuck in a locally optimal solution. Previous approaches dealing with combinatorially 
explosive search spaces about which little knowledge is available a priori are unable to learn how to 
escape a local optimum. 

In this survey we will consider two basic scheduling problems, the job shop (more general: disjunctive 
scheduling) and its generalisation: resource constrained project (cumulative) scheduling (which also 
covers problems of crew scheduling, time-tabling, etc.). We introduce solution strategies incorpo- 
rating problem specific knowledge, e.g. branching and neighbourhood definitions, and, recently even 
more important, capacity consistency tests in combination with propagation of constraints. Their 
combination leads to powerful procedures for solving intractable scheduling problems. 

1. Job Shop Scheduling 

1.1 Problem, Model and Complexity 

A job shop consists of a set of different machines (like lathes, milling machines, drills, etc.) that 
perform activities on jobs. Each job has a specified processing order through the machines, i.e. a job 
is composed of an ordered list of activities which are characterised by the machine required and the 
processing time on it. There are several constraints on jobs and machines: (i) There are no precedence 
constraints among activities of different jobs; (ii) activities cannot be interrupted (non-preemption) 
and each machine can handle only one job at a time; (iii) each job can be performed only on one 
machine at a time. While the machine sequence of the jobs is fixed, the problem is to find the job 
sequences on the machines which minimise the makespan, i.e. the maximum of the completion times 
of all activities. It is well known that the problem is ATP-hard (Lenstra and Rinnooy Kan 1979) 
and belongs to the most intractable problems considered. A huge amount of literature on machine 
scheduling, including scheduling of job shops, has been published within the last 35 years, among 
others the books by Conway et al. (1967), who wrote the first book on scheduling theory. A broader 
view on production and activity scheduling is provided in Pinedo (1995), Domschke et al. (1997), 
Blazewicz et al. (1996a), and Tanaev et al. (1994a, 1994b). A variety of scheduling rules for certain 
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types of job shops have been considered in an enormous number of publications, see the excellent 
surveys, also covering complexity issues and mathematical programming formulations, by Blazewicz 
et al. (1991, 1996b), Lawler et al. (1993), Blazewicz (1987). 

There are different problem formulations and we adopt the one presented by Adams et al. (1988). 
Let V = {0, . . . , n} denote the set of activities where 0 and n are the dummy activities “start” (the 
first activity of all jobs) and “end” (the last activity of all jobs), respectively. Let M denote the set 
of m machines and V be the set of ordered pairs of activities constrained by the precedence relations 
for each job. For each machine A;, the set Sk describes the set of all pairs of activities to be performed 
on machine A;, i. e. activities which cannot overlap (cf. (ii)). For each activity i, its processing time 
Pi is fixed, and the earliest possible start time of i is stj, a variable that has to be determined during 
the optimisation. Hence, the job shop scheduling problem (JSP) can be modelled as: 

min stn 



subject to 



sti + Pi < stj, 




(1) 


stj “1“ Pj ^ stij 


Vi,i € £k,^k e M, 


(2) 


sti > 0, 


Vi € V. 


(3) 



Restrictions (1) ensure that the processing sequence of activities in each job corresponds to the 
predetermined order. Constraints (2) demand that there is only one job on each machine at a time, 
and (3) assures completion of all jobs. Any feasible solution to the constraints (1), (2), and (3) is 
called a schedule. 

An illuminating problem representation is the disjunctive graph model. In the edge-weighted graph 
there is a vertex for each activity; additionally there exist two dummy activities, vertices 0 and n, 
representing the start and end of a schedule, respectively. For every two consecutive activities of the 
same job there is a directed arc; the start vertex 0 is considered to be the first activity of every job 
and the end vertex n is considered to be the last activity of every job. For each pair of activities 
{i,j} G Sk that require the same machine there are two arcs (i, j) and (j, i) with opposite directions. 
The activities i and j are said to define a disjunctive arc pair or a disjunctive edge. Thus, single 
arcs between activities represent the precedence constraints on the activities and opposite directed 
arcs between two activities represent the fact that each machine can handle at most one activity at 
the same time. Each arc (z, j) is labelled by a weight pi corresponding to the the processing time of 
activity i. All arcs from 0 have label 0. The JSP requires to find an order of the activities on each 
machine, i.e. to select one arc among all opposite directed arc pairs, such that the resulting graph 
is acyclic (i.e. there are no precedence conflicts between activities) and the length of the maximum 
weight path between the start and end vertex is minimal. Obviously, the length of a maximum 
weight or longest path in the graph connecting vertices 0 and i equals the earliest possible starting 
time sti of activity i\ the makespan of the schedule is equal to the length of the critical path, i.e. the 
weight of a longest path from start vertex 0 to end vertex n. Any arc (i, j) on a critical path is said 
to be critical; if i and j are activities from different jobs then {i,j) is called a disjunctive critical arc, 
otherwise it is a conjunctive critical arc. 
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The history of the JSP, starting more than 35 years ago, is also the history of a well known bench- 
mark problem consisting of 10 jobs and 10 machines and introduced by Fisher and Thompson in 
1963. This particular instance of a 10 job 10 machine problem opposed its solution for 25 years, 
leading to a competition among researchers for the most powerful solution procedure. Since then 
branch and bound procedures have received substantial attention from numerous researchers. For a 
comprehensive survey of the JSP and different solution strategies see Blazewicz et al. (1996b). 

1.2 Branch and Bound Algorithms 

Currently the job shop champions among the exact methods are, besides the branch and bound 
implementations of Applegate and Cook (1991), Martin and Shmoys (1996), and Perregaard and 
Clausen (1995), the branch and bound algorithms of Caseau and Laburthe (1995), Carlier and 
Pinson (1990, 1994), and Brucker et al. (1994, 1996a). The power of their methods basically results 
from some inference rules which describe simple cuts, and a branching scheme in which activities 
that belong to a block (a sequence of activities on a machine) on the longest path are moved to the 
block ends. 

Branching in the algorithm of Brucker et al. is restricted to moves of activities which belong to a 
critical path of a solution obtained by a heuristic based on dispatching rules. For a block B, i.e. 
successively processed activities on the same machine, that belongs to a critical path, new subtrees 
are generated by moving an activity to the very beginning or the very end of B. In both cases the 
critical path is modified and additional disjunctive arcs are selected. 

The idea of the branching of Martin and Shmoys (1996) is to use the tightness of time windows of 
possible activity start times as branching criterion. For tight or almost tight windows, where the 
window size equals (almost) the sum of the processing times of an activity set A, branching depends 
on which activity in A is processed first. When an activity is chosen to be first, the size of its window 
is reduced. The time window size of the other activities in A is updated in order to reflect the fact 
that they cannot start until the chosen activity is completed. The update of windows of activities 
on one machine causes further updates on all other machines. 

One of the main drawbacks of all branch and bound methods is the lack of strong lower bounds in 
order to cut branches of the enumeration tree as early as possible. Several types of lower bounds 
are applied in the literature, for instance, bounds based on Lagrangian relaxation (Van de Velde 
1991), bounds based on the optimal solution of a subproblem consisting of only two or three jobs 
and all machines (Akers 1956, Brucker and Jurisch 1993). However, the most prominent bounding 
procedure has been described by Carlier (1982) and Potts (1980). 

Consider any activity i in the job shop or in the disjunctive graph, which may already include a 
partial selection of arcs from disjunctive arc pairs. Then there is a longest path from the artifical 
vertex 0 to z of length esU as well as a longest path of length qi connecting the end of activity i to the 
dummy sink activity n. Activity i cannot be processed earlier than its release date esU (also called 
head) and its processing has to be finished until its due date - qi in order to not cause a schedule 
delay. The time qi is said to be the tail of activity i. There exist m one machine lower bounds for 
the optimal makespan of the job shop scheduling problem where each bound is obtained from the 
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exact solution of a one machine scheduling problem with release dates and due dates. Although this 
problem is A/'P-complete, Carlier’s algorithm quickly solves the one machine problem optimally for 
all problem sizes under consideration in the JSP. Balas et al. (1995) describe a branch and bound 
procedure that can yield improved one machine bounds. Their method additionally takes minimum 
delays between pairs of activities into account. 

Most branch and bound algorithms make use of additional inference rules — discussed in Section 2.1 
— in order to cut the enumeration tree during preprocessing or in the search phase. 



1.3 Approximation Algorithms 



Opportunistic Algorithms Nowadays, tailored approximation methods viewed as an opportunis- 
tic (greedy-type) problem solving process can yield optimal or near-optimal solutions even for prob- 
lem instances up to now considered as difficult. Here, opportunistic problem solving or opportunistic 
reasoning characterises a problem solving process where local decisions on which activities, jobs, or 
machines should be considered next are concentrated on the most promising aspects of the problem, 
e.g. job contention on a particular machine. Hence subproblems defining bottlenecks are extracted 
and solved separately and serve as a basis from which the search process can expand. Problem 
decomposition takes place until, eventually, sufficiently small subproblems are created for which 
effective exact or heuristic procedures are available. However, the way in which a problem is de- 
composed affects the quality of the solution reached. Not only the type of decomposition such as 
machine/resource (Adams et al. 1988), job/order (Pesch 1993, 1994), or event based (Sadeh 1991) 
has a dramatic influence onto the outcome, but also the number of subproblems and the order of their 
consideration. In fact, an opportunistic view suggests that the initial decomposition be reviewed in 
the course of problem solving to see if changes are necessary. The shifting bottleneck heuristic from 
Adams et al. (1988) and its improving modifications from Balas et al. (1995) and Dauzere-Peres 
and Lasserre (1993) are typical representatives of opportunistic reasoning. These resource based 
heuristics successively solve one machine problems to optimality and introduce the one machine so- 
lutions into the overall schedule. The quality of the schedules obtained by the shifting bottleneck 
procedure heavily depends on the sequence in which the one machine problems are solved, and se- 
quence changes may yield substantial improvements. This is the idea behind the second version of 
the shifting bottleneck procedure, known as the SB2-heuristic, as well as behind the second genetic 
algorithm approach by Dorndorf and Pesch (1995). 



Local Search In the recent years, local search based scheduling has become very popular; for a 
survey see Anderson et al. (1995), as well as Vaessens et al. (1996) and Aarts et al. (1994). These 
algorithms are all based on a certain neighbourhood structure and some rules that define how to 
obtain a new solution from an existing one. Initial approaches used the very simple neighbourhood 
structure Nl: 
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N1 A transition from a current solution to a new one is generated by replacing (in the disjunctive 
graph representation of the current solution) a disjunctive arc {i,j) on a critical path by its 
opposite arc {j,i). 

In other words, N1 means reversing the order in which two activities i and j are processed on a 
machine where these two activities belong to a longest path. This parallels the early branching 
structures of exact methods. It is possible to construct a finite sequence of transitions leading from 
a locally optimal solution to the global optimum. This is a necessary and sufficient condition for 
asymptotic convergence of simulated annealing. The neighbourhood N2 corresponds to the branching 
structure of the algorithm of Brucker et al. (1994, 1996a): 

N2 Consider a feasible solution and a critical arc {i,j) defining the processing order of activities 
i and j on the same machine, say machine m. Define p{i) and s{i) to be the predecessor and 
successor of z, respectively, on m. Restrict the choice of arc (z, j) to those vertices where at least 
one of the arcs {p{i),i) or {j^s{j)) is not on a longest path, i.e. z or j are block end vertices. 
Reverse (i,j) and, additionally, also reverse {p{h),h) and (k^s{k)) — provided they exist — 
where h directly precedes z in its job, and k is the immediate successor of j in its job. The 
latter arcs are reversed only if a reduction of the makespan can be achieved. 

Dell’Amico and Trubian (1993) considered the problem as being symmetric and scheduled activities 
bidirectionaly, i.e. from the beginning and from the end, in order to obtain a priority rule based 
feasible solution. The resulting two parts finally are put together in order to constitute a complete 
solution. The neighbourhood structure N3 employed in their tabu search algorithm extends the 
connected neighbourhood Nl: 

N3 Let (z, j) be a disjunctive critical arc. Consider all permutations of the three activities {p(z), z, j} 
and {z, j, 5(j)} in which (z,j) is inverted. 

Again, it is possible to construct a finite sequence of moves with respect to N3 which leads from 
any feasible solution to an optimal one. In a restricted version of N3, arc (z, j) is chosen such that 
either z or j is the end vertex of a block. In other words, arc {i,j) is not considered as candidate 
when both (p(z), z) and {j, s{j) are on a longest path in the current solution. However, the resulting 
neighbourhood is no longer connected. Another branching scheme can be considered to define a 
neighbourhood structure N4: 

N4 For all activities z in a block move z to the very beginning or to the very end of this block. 

Once more, N4 is connected, i.e. for each feasible solution it is possible to construct a finite sequence 
of moves, with respect to N4, leading to a globally optimal solution. 

The most efficient tabu search implementations are described by Nowicki and Smutnicki (1996) and 
Balas and Vazacopoulos (1998). The size of the neighbourhood Nl depends on the number of critical 
paths in a schedule and the number of activities on each critical path. Because it can be very large, 
Nowicki and Smutnicki consider a smaller neighbourhood N5 that restricts Nl (or N4) to reversals 
on the border of a block. Moreover, only a single, arbitrarily selected critical path is considered. 
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N5 A move is defined by the interchange of two successive activities i and j, where i or j is the first 
or last activity in a block that belongs to a critical path. In the first block only the last two 
activities and, symmetrically, in the last block of the critical path only the first two activities 
are swapped. 

The design of a classical tabu search algorithm is straightforward. The search usually stops when an 
optimal schedule is detected or the number of iterations without any improvement exceeds a certain 
limit. Nowicki and Smutnicki note that the essential disadvantage of this approach consists of loosing 
information about previous runs. Therefore they suggest to build up a list of the x best solutions and 
their associated tabu lists during the search. Whenever the classical tabu search has finished, they 
go to the most recent entry, i.e. the best schedule S frond this list of at most x solutions, and restart 
the classical tabu search. Whenever a new best solution is encountered, the list of best solutions is 
updated. This extended tabu search “with backtracking” continues until the list of best solutions is 
empty. 

The idea of Balas and Vazacopoulos’ (1998) guided local search procedure is based on reversing 
more than one disjunctive arc at a time. This leads to a considerably larger neighbourhood than in 
the previous cases. Moreover, neighbours are defined by interchanging a set of arcs of varying size, 
hence the search is of variable depth and supports search diversification in the solution space. The 
employed neighbourhood structure N6 is an extension of all previous neighbourhoods. Consider any 
feasible schedule S and any two activities i and j to be performed on the same machine, such that i 
and j are on the same critical path, say P(0, n), but not necessarily adjacent. Assume i is processed 
before j. Again, p(z), p{j), and s(i),s(j), are the immediate machine predecessors and machine 
successors of i and j in 5; additionally, let a{i),a{j)^ and 6(z),6(j), denote the job predecessors and 
job successors of activities i and j, respectively. Moreover, let ecti := esU and q{i) := pi + qi be 
the length of a longest path (including the processing time of z, pi) connecting 0 and z, or z and n. 
An interchange of z and j either is a move of z right after j (forward interchange) or a move of j right 
before z (backward interchange). We have seen that the schedule S cannot be improved by such an 
interchange if z and j are adjacent and none of them is the first or last activity of a block in P(0, n). 
In other words, in order to achieve an improvement either a(z) or h{j) must be contained in P(0, n). 
This statement can easily be generalised to the case where z is not an immediately preceding activity 
of j. Thus for an interchange on z and j to reduce the makespan, it is necessary that the critical 
path P(0, n) containing z and j also contains at least one of the activities a(z) or b{j). Hence, the 
number of ’’attractive” interchanges reduces drastically. However, the question remains, under which 
conditions an interchange on z and j is guaranteed not to create a cycle. It is easy to derive that 
a forward interchange on z and j yields a new schedule S' (obtained from 5) if there is no directed 
path from b{i) to j in 5. Similarly, a backward interchange on z and j will not create a cycle if there 
is no directed path from z to a{j) in S. Now, the neighbourhood structure N6 can be introduced. 

N6 A neighbour S' of a schedule S is obtained by an interchange of two activities z and j in a 
block on a critical path. Either activity j is the last one in the block and there is no directed 
path in S connecting the job successor of z to j, or, activity z is the first one in the block and 
there is no directed path in S connecting z to the job predecessor of j. 
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While the neighbourhood N1 involves the reversal of a single arc (i, j) on a critical path, the more 
general move defined by N6 causes the reversal of a potentially large number of arcs. In order to 
combine local search procedures operating on different neighbourhoods (which makes it more likely 
to escape local optima and explore regions not available by any single neighbourhood structure) 
Balas and Vazacopoulos combine their guided local search with the shifting bottleneck procedure. 
The resulting algorithm currently is the most powerful heuristic for the JSP. 



2. Consistency Tests for Disjunctive and Cumulative Scheduling 

The purpose of this section is to present a class of logical tests called interval capacity consistency 
tests which are based on resource constraints. These tests allow to reduce the set of possible activity 
start times by ruling out infeasible start time assignments. They can be applied in scheduling 
algorithms such as list scheduling or branch and bound procedures, or in constraint propagation 
based scheduling systems. The benefit of the tests is that they can reduce the search space and 
direct an algorithm towards good solutions. Here, we are only interested in the tests themselves and 
do not address scheduling algorithms in which they can be embedded. Since the tests only eliminate 
solutions incompatible with the capacity constraints, they are independent of the overall objective 
function to be optimised. The assumptions that we will make are rather general and cover AfV- 
hard models such as the JSP and the resource-constrained project scheduling problem with simple 
(RCPSP) or generalised precedence constraints (RCPSP/max), cf. Brucker et al. (1998). 

An activity i is characterised by its processing time p* and resource requirements riki it requires rik 
units of a renewable resource k for each of Pi time units, and it releases the resource units again upon 
completion. Once begun, an activity must be processed in an uninterrupted fashion. A resource k 
is available in constant amount Rk. By imposing an upper bound on the latest completion time and 
a lower bound on the earliest start of all activities, activity start time windows can be derived in a 
straightforward way. The start time window of activity i is the interval between the earliest start 
time esti and the latest start time Isti of and the domain 6i of activity i is the set of all possible 
start time assignments within [estijsti]. Activity domains can, of course, be approximated by start 
time windows, and this approximation is then referred to as activity release times and due dates, or 
heads and tails (cf. Section 1.). 

In the following we assume that all activity domains have been made consistent with the temporal 
(or precedence) constraints, e.g. through a CPM Istyle time window calculation. We are interested 
in further reducing the domains by applying interval consistency tests, some of which are known 
under the names immediate selection, edge finding, and energetic reasoning. This domain reduction 
process can be understood as an adjustment of heads and tails or time bound adjustment. 

Before continuing, we briefly introduce some additional notation. We denote the set of all activities as 
V and the subset of all activities to be processed by resource k as VkQV, with V/fe = {z G V\vik >0}. 
We will often consider subsets ^ C Vjb of activities. In analogy to the earliest and latest activity 
start times, we use ecU and Icti as the earliest and latest completion time of activity i. 
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Interval Work An activity i requires an amount of work wik := from resource k. A time 
interval is capacity consistent if the amount of work available from resource k within the interval 
is greater than the interval work required by all activities. Let us consider the work of an activity 
that must fall into a time interval [^ 1 ,^ 2 )- The smallest amount of time, or interval processing time^ 
during which an activity i must be executed in [^ 1 ,^ 2 ) is 



Pi{tut 2 ) = max {0, min {pi, t 2 - h, ecti - ti, ^2 - , (4) 

and the corresponding interval work is Wikih^ ^ 2 ) = ^ 2 )- The interval work for a set ^ C Vk is 

W {Aj ^ 1 ,^ 2 ) = h)^ Using this interval work definition we can state an interval consistency 

constraint in its general form: 

Rk * (^1 — ^2) > ^2)- ( 5 ) 

Clearly, a scheduling problem can only have a feasible solution if this constraint holds for all resources 
k and time intervals [^ 1 ,^ 2 )- However, this is only a necessary condition and does not guarantee that 
a feasible schedule exists. 

The notion of interval capacity consistency as defined here has to the best of our knowledge first 
been suggested by Lopez, cf. Lopez et al. (1992), under the name energetic reasoning. Schwindt 
(1998) has independently developed a concept of interval work. The area of the rectangles defined 
by activity processing time and resource requirements can be interpreted as work or energy, and we 
use the terms interchangeably. 

Constraint 5 can also serve to derive lower bounds for makespan minimisation problems in the 
following way: Impose a hypothetical upper bound UB on the makespan; if this leads to a violation of 
Constraint 5 then C/5 -h 1 is a lower bound. Due to the principle of refuting hypothetical constraints, 
this general approach has been called destructive improvement (Klein and Scholl 1997). Nuijten 
(1994), Pesch and Tetzlaff (1996), Heilmann and Schwindt (1997), or Schwindt (1998) have used 
destructive improvement to calculate lower bounds for the JSP, RCSP, and RCPSP/max based on 
Condition 5. Test values for UB are usually chosen through a dichotomising search. This type of 
bound argument can, equivalently, be applied whenever any of the tests described in the following 
sections causes an activity domain to become empty. 

2.1 Consistency Tests for Disjunctive Scheduling 

This section discusses tests based on Constraint 5 that can be applied in disjunctive scheduling, 
where all activities i £ A C Vk are mutually exclusive in the sense that they exclusively occupy 
resource k throughout their processing time. This is of course always the case if all activity sizes 
and resource capacities are equal to one, as in many machine scheduling problems such as the JSP. 
However, even if rik > 1 for some i € Vfc, and consequently Rk > I, the tests may still be used for 
subsets of “disjunctive” activities. For instance, it is likely that difficult RCPSP instances contain 
such disjunctive scheduling elements (Kolisch et al. 1995). Hence, disjunctive consistency tests have 
been suggested for the makespan minimisation RCPSP (Brucker et al. 1996b, Klein and Scholl 1997, 
Baptiste et al. 1998) and RCPSP/max (Schwindt 1998). 
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Input/output test Consider a set A of activities to be processed by the same resource. Carlier 
and Pinson (1989) have derived conditions under which it can be concluded that an operation i € A 
must be scheduled first or last in A. If i is scheduled before or after A \ {i} we may also think of i 
as the input or output of .A \ {i}, hence the name of the conditions. Using the shorthand notation 

= HieAPi for the total processing time of A, the tests can be described as follows: 

If maXueA\{t],veA,u:}(:v{lcty - estu) < P{A) then i must precede all activities in {z}. 

If max«e^,v€. 4 \{i},ri 7 tv(^otv “ < P{A) then i must succeed all operations in A.\ {i}. 

In branch and bound procedures that branch over disjunctive edges, the test may be employed 
to immediately select the orientation of edges, a process often called immediate selection, as first 
suggested by Carlier and Pison, or edge finding, a term introduced by Applegate and Cook (1991). 
The input/output test was first described by Carlier and Pinson in the context of a branch and 
bound algorithm for the JSP; the tests that they actually implemented in their initial algorithm 
were limited to two-element sets A and one additional heuristically determined A and i € >1 for 
each resource. Using these tests, they were for the first time able to optimally solve the 10 x 10 JSP 
instance mentioned in Section 1.1. 

Efficient algorithms that have later been developed for testing the input /output conditions for all 
A and i usually use an ordering of activities according to earliest start and latest completion times. 
The challenging part is to test the input/output conditions and calculate preemptive bounds at the 
same time. Carlier and Pinson (1990), Nuijten (1994), and Martin and Shmoys (1996) have designed 
0(|Vjfep) algorithms for testing all subsets A C Vk- The algorithm of Nuijten has the interesting 
property that it can be generalised for cumulative scheduling. 0(|Vjb| log |Vjb|) algorithms for testing 
all subsets have been described by Carlier and Pinson (1994) and by Brucker et al. (1996a). Caseau 
and Laburthe (1995) describe an algorithm based on the concept of “task intervals” for checking 
all sets A with effort 0(|Vib|^). The advantage of their approach is that the consistency conditions 
can be evaluated incrementally within a search procedure, so that the effective time complexity for 
performing the tests at each node of a search tree is usually lower than 0(|Vjfcp). 

Input-or-output test ^Prom the input/output condition it can be deduced that an operation 
i ^ AQVk must be scheduled first or last in A. The weaker input-or-output condition can be used 
to show that a precedence relation must exist between a pair of activities i and j from set A: 

If TnaXueA\{i},veA\{j},u:^v{^cty — esty) < P{A) then i must be scheduled first or j must be 

scheduled last in A. If i ^ j then i must precede j. 

A similar condition for the special case where i = j has been described by Carlier and Pinson 
(1990), see also Blazewicz et al. (1997, 1998). Stronger conditions based on considering all sets A 
of cardinality r, hence called r-set conditions, have been discussed by Brucker et al. (1996a). They 
describe an 0(|Vfcp) 3-set algorithm that checks all activity sets of cardinality three and detects 
all pairwise ordering relations derivable from triples. The algorithm thus implements the input-or- 
output test for 1 .4 1 = 3. Judging from the implementation within their branch and bound procedure 
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for the JSP, the efficiency of the 3-set tests is comparable to that of the input/output tests. The 
development of an algorithm with low polynomial time complexity for testing the input-or-output 
conditions is an open issue. 

Input/output negation test By farther relaxing the condition to be tested, we can still draw ad- 
ditional conclusions in situations where the input-or-output condition and the stronger input /output 
conditions do not hold. Consider an activity i € AQVk'- 

If maxve^\{i} (/ctv - est*) < P{A) then i must not precede *4 \ {z}. 

If — estu) < P{A) then i must not succeed *4\ {z}. 

The input /output negation test has first been suggested by Carlier and Pinson (1989). Most authors 
working on consistency tests have considered the test in some form. However, an algorithm that 
tests all interesting A and z with effort 0 (|Va;P) has only recently been developed by Baptiste and 
Le Pape (1996). Other researchers have often applied the tests in an incomplete way, testing only 
some A and z. Caseau and Laburthe (1995) have integrated the tests in their task interval algorithm 
which tests input /out put conditions and the negation conditions with effort 0(|V;fc|^). 

Bound-consistency The sequence consistency tests described above can be used to derive domain 
reductions. There exists a close relationship between these domain-oriented consistency tests and the 
concepts of consistency defined in the constraint satisfaction and constraint propagation literature 
(cf. Tsang 1993). Nuijten (1994) has shown that the aforementioned consistency tests establish 
2-bound-consistency, and Dorndorf et al. (1998b) have derived additional tests for 3-b-consistency. 

2.2 Consistency Tests for Cumulative Scheduling 

While disjunctive scheduling or sequencing is concerned with unit resource requirements and ca- 
pacities, cumulative scheduling considers the general case of arbitrary requirements and capacities. 
The disjunctive sequence consistency tests developed in the preceding section can be generalised for 
cumulative scheduling in a straightforward way by considering interval work instead of processing 
times. This relation was first pointed out by Nuijten (1994); see also Nuijten and Aarts (1996). 

In the disjunctive case it can be shown that it is sufficient to consider time intervals defined by 
the start and completion times of activities (activity or task intervals), and that there is nothing to 
be gained from using interval work or processing time instead of set based work on the right side 
(Dorndorf et al. 1998a). However, it turns out that this is not the case for cumulative scheduling. 
Consider the difference between available and required resource capacity for a given time interval, 
i.e. the difference between the left and right side of the interval consistency constraint 5, as defined 
by the following slack function: 

slack(V*, h, < 2 ) = Rk-{t 2 -ti)-W (V*, (i, t 2 ). (6) 

By studying the possible extrema of the slack function for a given V*, the set of intervals [< 1 ,^ 2 ) can 
be characterised for which the function can take a local or global minimum and may thus violate 




363 



an interval consistency constraint. Schwindt (1998) and Baptiste et al. (1998) have recently shown 
that the number of such minimum-slack intervals is of order of magnitude 0{\Vk\^) and have given a 
characterisation of the intervals. The set of minimum-slack intervals is larger than the set of activity 
intervals but still of order 0{\Vk\^). 

Baptiste et al. have developed an 0{\Vk\^) algorithm for testing the interval consistency constraint 
for all minimum-slack intervals. Schwindt has used the condition to calculate lower bounds for 
RCPSP/max instances. Tests that check the remaining slack for unit time intervals have been 
described under the name “time table based constraint propagation” (Le Pape 1994). Dorndorf et 
al. (1998c) have used this test in an algorithm for the RCPSP/max. 

3. Summary 

The interval consistency tests described have been applied frequently and with great success in branch 
and bound and local search algorithms for solving the JSP. Fewer and so far less conclusive results 
have been reported for the application of the tests for cumulative scheduling. Many of the tests are 
available in general purpose scheduling software libraries such as ILOG Scheduler (Le Pape 1994, 
Nuijten and Le Pape 1998), CHIP (Aggoun and Beldiceanu 1993), or CLAIRE Schedule (Le Pape 
and Baptiste 1996). 
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Summary: In this paper, we develop efficient heuristic algorithms for multiprocessor task schedul- 
ing in a two-stage flow-shop environment to minimize makespan. We also derive a very effective 
lower bound for the problem. Then, we analyze the worst case error-bound of the heuristic algo- 
rithms. Next, we perform computational experiments to analyze the performance of the heuristic 
algorithms by computing the average percentage deviation of each heuristic solution from the lower 
bound. The results of the computational experiment on a set of randomly generated problems show 
that the proposed heuristic algorithms perform well. Furthermore, we consider the reverse problem 
and show that the two problems are equivalent. Finally, we identify a special case of the problem 
that is polynomially solvable. 



1. Introduction 

1.1 Problem Definition 

Consider the following multiprocessor task scheduling problem: There is a set J oin independent and 
simultaneously available jobs to be processed in a two-stage flow-shop where stage 1 has two identical 
parallel processors and stage 2 has one processor. All the processors are continuously available from 
time 0 onwards and each processor can handle no more than one job at a time. The jobs are divided 
into two groups, namely G\ and G 2 , according to their processor requirements at the first stage. 
The jobs in group G\ can be processed without interruption by any one of the two processors at the 
first stage. The jobs in group G 2 must be processed simultaneously without interruption by both 
of the processors at the first stage. Hence we have multiprocessor tasks at the first stage for which 
the processor requirements and the processing times are denoted by sizen and p^i, respectively 
(z = 1,2, ...,n). All jobs, both in G\ and G 2 , should be processed without interruption on the 
processor at the second stage for a period of Pi 2 after finishing their processing at the first stage 
{i = 1,2, ...,n). Hence jobs flow through from stage 1 to stage 2 while satisfying the flow-shop 
and the multiprocessor task constraints. The objective is to find an optimal schedule for the jobs so 
as to minimize the maximum completion time of all jobs (makespan), Cmax- Using the well-known 
three-field notation with extensions introduced later (Drozdowski (1996), Hoogeveen et al. (1996)), 
we denote this scheduling problem as F2(P2, l)\sizen € {1, 2}|C'max- F2{P2, 1) denotes a two-stage 
flow-shop with two identical parallel processors at the first stage and one processor at the second 
stage, which is usually referred to as a hybrid flow-shop. Note that F2(P2,l)\sizen G {l,2}|C'max 
problem will reduce to the classical hybrid flow-shop problem if we have only G\ type jobs and to 
F2 1 1 Cmax problem if we have only G 2 type jobs. 
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1.2 Literature Review 

Multiprocessor task (MPT) scheduling and hybrid flow-shop scheduling (HFS) have traditionally 
been studied separately. In previous studies, HFS problems are based on the assumption that a task 
can be processed by at most one processor at a time. Various researchers develop different heuristic 
algorithms for HFS and analyze their either the average performances (Gupta (1988), Guinet et al. 
(1996), Gupta et al. (1997)) or the worst-case error bounds (Lee and Vairaktarakis (1994), Chen 
(1995)). Computational complexity results for several preemptive scheduling problems in a two-stage 
hybrid flow-shop are discussed by Hoogeveen et al. (1996). 

MPT scheduling problems are classifled according to whether the processor requirements are prespec- 
ified or not. MPT scheduling problems in the single-stage shops, where the processor requirements 
are prespecified, are addressed by Kubale (1987), Blazewicz et al. (1992), Hoogeveen et al. (1994), 
Bianco et al. (1995) and Kubale (1996), among others. Brucker and Kramer (1995, 1996) discuss 
this class of MPT scheduling problems in several multi-stage shops. The MPT scheduling problems 
without prespecified processor requirements are studied mostly in single-stage shops (see, for exam- 
ple, Lloyd (1981), Blazewicz et al. (1986), Du and Leung (1989)). All above papers mainly address 
the computational complexity issues of MPT scheduling problems. Most recently, several heuristic 
algorithms are developed for scheduling MPTs with non-prespecified processor requirements in a 
two-stage flow-shop with arbitrary number of identical parallel processors at each stage by Oguz et 
al. (1998). An extensive survey on scheduling MPTs is presented by Drozdowski (1996). 

2. Heuristic Algorithms and a Lower Bound 

Since F2{P2,l)\\Cmax is strongly NP-hard (Hoogeveen (1996)), F2{P2,l)\sizen € {l,2}|Cmax is 
also NP-hard in the strong sense. Thus, we concentrate on developing efficient heuristic algorithms 
to find approximate solutions to this problem. In developing the heuristic algorithms, we consider 
Johnson’s algorithm (Johnson, 1954), which solves the traditional two-stage flow shop problem, 
i^2||Cmax, optimally. 

2.1 Algorithm Description 

The heuristic algorithms proposed are based on well-known list scheduling associated with the sub- 
sequences obtained in a Johnson’s sequence. A job sequence S is said to be a Johnson’s sequence 
if it contains two subsequences. Si followed by S 2 where the jobs in S\ {S 2 ) satisfy the inequality 
Pii < Pi 2 (Pii > Pi 2 ) and are sequenced in the non-decreasing (non-increasing) order of pn {pi 2 ). By 
using this definition, we obtain a Johnson’s sequence with Gu and Gu {G 21 and G 22 ) for jobs in 
Gi (G 2 ) for F2(P2jl)jsizen € {l,2}|Cmax- We have the following four heuristic algorithms from 
different combinations of the subsequences obtained for F2(P2, l)\sizen e {l,2}lCmax: 

Heuristic 1 {HI): The job sequence is {Gu,G 2 i,Gi 2 ,G 22 }- 
Heuristic 2 (H2): The job sequence is {G 2 i,Gn,Gi 2 ,G 22 }- 
Heuristic 3 {H3): The job sequence is {Gn,G 2 i,G 22 ,Gi 2 }- 
Heuristic 4 (H4): The job sequence is {G 21 J Gu, G 22 ) G 12 }. 
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In all heuristic algorithms, given a job sequence, if the next unscheduled job in the sequence is in Gi, 
it is scheduled whenever any one of the two processors becomes free at the first stage. If the next 
unscheduled job is in G 2 , it is scheduled whenever both of the two processors become free at the first 
stage. At the second stage, the jobs are scheduled according to the first-in-first-out rule. In other 
words, the jobs are scheduled at their earliest possible starting time at the second stage as they are 
processed and finished at the first stage so that the fiow-shop constraints are not violated. 

2.2 The Lower Bound of 

It is easy to see that the following is a lower bound on the value of an optimal schedule for 
F2(P2,l)|s2zea€{l,2}|C„,ax: 

1 n 

LB = max{ m^{pii +Pi2}, ^ Pn + X) P<i + {Pii}}- 

- ^ ieGi ie02 - 1=1 - 

3. Worst-Case Error Bound of the Heuristic Algorithms 

In this section, the worst-case error bound of the heuristic algorithms proposed is derived. Let be 
the idle time at the second stage between job J* and its proceeding job, be the number of jobs 
in Gij, optimal makespan, and Cmax{Hl) be the makespan of schedule obtained from 

heuristic HI (/ = 1, 2, 3, 4; i = 1, 2, . . . , n; j = 1, 2). First, we have the following result. 

Lemma In any schedule tt obtained from heuristic algorithms, HI, H2, H3 and H4, Ii < pn for all 
jobs (2 = 1,2,..., n). 

Theorem 1 For F2(P2, l)|s 22 ;eii G {1, 2}|Cmax, Gra^{H\)/Cl,^ < 2. 

Proof Without loss of generality, we assume that the jobs are indexed as 1, 2, . . . , n in the schedule 
obtained from heuristic HI. Suppose that job Jk is the last job in the schedule that has an idle time 
at the second stage, i.e. h > 0, and /^ = 0 for 2 = A; -h 1, . . . , n. We need to consider three cases: 



1 . U k < rill + ^21 j then Jj € Gu U G21 and pn < pi2, \ <i <k. 



Considering the second stage, we have 



Cn.ax(i^l) 



n k 



^Pi2 + 5^4 



< 

< 

< 



n k 



5^Pi2 + 

1=1 i=l 



n k 



+ ^Pi2 
i=l i=l 






2 . If Till + ri2i < /u < nil + ^21 "b ^12) then J{ 6 G12 U G22 pn > pi2, k < i < n. 
Let Ski be the starting time of job Jk at the first stage. Then: 

nii+Ti2i n k—1 

Ski < Pil + 2 

i=l ^ i=nii+n2i+l 
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and the makespan will be 



— Ski Pkl Pk2 

i—k+l 

wii nii+ri 2 i 2 ” 

^ 5ZPtl + Ptl + X Pil +PA:1 +Pib2 + Pt2- 



i=l i=nii+l ^ »=T»ii+n2i+l 

Now consider the optimal makespan: 

^max ^ H + 9 

ieG 2 ^ ieGi 

1 «ii nii4-n2i 1 nii+n 2 i+ni 2 n 

= ^EPil+ E Pil + 9 E Pil+ E 



i-k+l 



9 X ■ X V X-li ' rt 

t=l t=nii+l nii+n 2 i+l nn+n 2 i+ni 2 +l 

nii+n 2 i 



Pil 

«ii+n 2 i+ni 2 



2 ^11 nm-n2i 2 A;— 1 . 

- 2 51 + 5V 5Z P*1 +PA:1 + ^ Pi2) + 2-^ 

1=1 t=mi+l nii+n2i+l A:+l nu+n2i+ni2+l 



Pi2- 



Therefore, 



2 ”11 2 . »iii+n 2 i+ni 2 

Cmax{H 1) - C^ax ^ o 5Z ^*1 ■*“ o [Pkl + 51 ^* 2 ) + Pk2 

^ i=l ^ i=k+l 

^ nil nii+n 2 i+ni 2 

- 2 ^ + Pfc2 + X^ Pi2) 

t=l i=k 

^ 2 + PA :2 + XZ ^*2) 



i€Gi 

< ^( 2 c;j = c;^. 

3 . If nil + ^21 + ni2 < k, then Jj € G22 and Pn > Pi2, i < k < n, and we have: 

k n 

Cmax{Hl) < Ptl + Pi2 

t=l i=k 

k n 

— XI Pil + xz ^*1 51 Pi^ y^Pi2- 

ieGi i€G2i i=nii+n2i+ni2+l i=k 

Now consider the optimal makespan: 

^max > 9 5 Z 51 Pi^ 



' ieGi 



i€G 2 



- 9 53 + 51 + 53 

»GGi *€G 2 i 



Pil 



Therefore, 



t=mi+n 2 i+ni 2 +l 
J /C""l 71 

- 2 ^ ^ ^ 53 ^^* 2 - 

i€Gi iEG2i i=nii+Ti2i+ni2+l A: 



< ^EPii+p*i 

^ ieGi 

< ^^ax. 



Corollary < 2, C,n^{H 3 )/C'^^ < 2 , and C„,ax(i? 4 )/C;„ < 2 . 

Proof Similar to that of Theorem 1 . 
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4. Experimental Design and Computational Results 

In this section, we present the design and the results of our computational study performed to test 
the performance of the heuristic algorithms proposed. In the computational experiments, processing 
times are randomly generated from a discrete uniform distribution as follows: If the jobs are in Gi, 
then pii € [1, lOOki] and Pi 2 G [1, 100] and if the jobs are in G 2 , then pn e [1, IOO/D 2 ] and Pi 2 € [1, 100] 
{i = 1, 2, . . . , n). Different values of ki and k 2 will determine the difficulty of the problems since they 
affect the load at each stage. We first fix ^2 = 1. Since there are two processors to process jobs in 
Gi at the first stage, we assume that the jobs in Gi have longer processing times at the first stage 
and let ki = 1, 2, 3. Then we fix /ci = 1 and let k 2 = 0.5, 0.8, 1, 2, 3, 4, 5. The problems are generated 
so that the number of jobs in each group, rii and n 2 , ranges from 30 to 100. For each combination 
of parameters, we generate 200 problems. 

In addition to the four heuristic algorithms proposed, we also consider a new heuristic algorithm, 
H, which takes the minimum of the makespan produced by heuristics ifl, if 2, H3, and if 4. The 
performance of these five heuristic algorithms is analyzed with respect to the average percentage 
deviation (APD) of each heuristic solution from the lower bound and their ability to achieve the 
lower bound. The APD of each heuristic solution from the lower bound is defined as ((Cmax(^0 ~ 
LB) /LB) X 100, where Cmax(Hl) denotes the Cmax obtained by heuristic HI {I = 1,2, 3, 4). The APD 
for each individual heuristic algorithm together with heuristic H is given in table 1. The number 
of problems (out of 200) for which the heuristic solution is equal to the lower bound is also given 
in table 1 for each heuristic algorithm. In the computational study, we observed that the problems 
have a similar behavior for different values of rii and ri 2 . Hence, we will only report the results of 
those problems where ni,n 2 € {50, 100} to avoid unnecessary data. 

When we compare the heuristic algorithms, excluding H, we observe from table 1 that heuristic H2 
performs better than others in most of the cases on average. Results listed in table 1 show that 
heuristics if 1 and H3 produce better results than heuristic if 2 only when the number of jobs in G\ 
is no less than the number of jobs in G 2 and the distributions of processing times are identical for 
G\ and G 2 jobs. Table 1 displays that heuristic fi4 does not produce better solutions than other 
heuristic algorithms. 

It is also apparent from table 1 that heuristic H performs very well and it is very effective in finding 
a good approximate solution. Table 1 confirms that heuristic H finds very near solutions to the lower 
bound for most of the problems. These results demonstrate that the proposed heuristic algorithms 
are effective in terms of the average performance. It should be noted that since LB is used in 
computing APD, the performance of heuristic algorithms may be even better. 

We can see that all heuristic algorithms have a tendency to become better when the number of jobs 
increases and they are more effective when the distributions of job processing times are similar. We 
also observe that heuristic if performs very good for small and large values of k 2 , but its performance 
deteriorates for the medium values of k 2 . These results suggest that the problems become easy when 
the loads at both stages are similar or one of the stages becomes dominant. 

Another observation we can derive from table 1 is that although the APD of heuristic if is within 
0.8% of the optimal solution, the number of problems for which Cmax(H) = LB is not high when the 
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distributions of processing times of jobs in G\ and G 2 are not identical and it gets worse when the 
difference between ki and k 2 is larger. This implies that the lower bound is not much effective for 
those cases and more effective lower bounds should be identified. 

The performance of individual heuristic algorithms ffl, H2, if 3, and if 4 is further studied on the 
basis of their contributions to heuristic H. In table 2, we report the number of problems for which 
each individual heuristic algorithm finds the minimum makespan used by heuristic if. It is clear 
from table 2 that heuristic if 2 yields the schedule with minimum makespan for most of the problems. 
This supports the observations we made from table 1. 

Finally, we note that since the time complexity of each heuristic algorithm is O(nlogn), all heuristic 
algorithms proposed are very efficient beside being very effective. 

5. The Reversed Problem and a Special Case 

5.1 The Reversed Problem 

In this section, we consider the reversed problem of F2{P2^l)\sizen G {l,2}|Cniax where the first 
stage has one processor and the second stage has two processors and show that the two problems 
are equivalent. Thus all the results of F2{P2,\)\sizen G {l,2}lCmax can be used for the reversed 
problem. 

Given a problem instance of F2{P2^\)\sizen G {l,2}|C'niax with Gi, G 2 , Pij and sizen {i = 
1,2, ...,n;j = 1,2), we can construct the problem instance of the reversed problem as follows: 
For each job J{ G Gi {G 2 ) in the instance of F2(P2, l)\sizen G {1, 2}|Cmaxj we will have a job J- in 
the instance of the reversed problem with = Pi 2 , p ^2 = P» 1 j and size ^2 = sizen {i = 1, 2, . . . , n) 
and this job will belong to group G^^ (^ 2 ). For a given problem instance of the reversed problem, 
we can obtain the problem instance of F2(P2, l)\sizen G {l,2}lCmax in the same way. Let Cma^{n) 
denote the makespan of schedule tt, and Cij denote the completion time of job Jj at stage j in tt 
( z = l,2,...,n, j = 1,2). 

Theorem 2 For any schedule tt of F2(P2,l)\sizen G {l,2}|Cmax, there is a schedule tt' of the 
reversed problem where CmaxM = Cniax(7r') and vice versa. 

Proof tt' can be constructed by setting the starting time of job J/ at the first stage and at the second 
stage as Cmax(7r) - C {2 and C'max(^) - C'»i, respectively (z = 1, 2, . . . , n). It is easy to see that tt' is 
a feasible schedule of the reversed problem and Cmax(^) = Cmaxi^r')- For a given schedule tt' of the 
reversed problem, we can obtain a schedule tt of F2(P2, l)\sizen G {1, 2}|C'max in the same way. ■ 

5.2 A Special Case 

Although F2(P2, l)\sizen G {l,2}|Cmax is NP-hard in the strong sense, we identify one case which 
can be solved in polynomial time. If maxi<i<n{pa} < niini<i<n{pi 2 }, then the second stage will be 
the dominant stage and there will be no idle time at the second stage except before the job scheduled 
in the first position. Hence, Cmax = Pk\ + E?=iPi 2 - This implies that the first job in an optimal 
schedule, say job Jk, should be selected so that pk\ = mini<i<n{p»i}. The rest of the schedule will 
be arbitrary. Thus, the computational complexity of this problem is 0(n). 
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6. Conclusions 

In this paper, we consider a two-stage flow-shop scheduling problem with multiprocessor tasks to 
minimize makespan. We develop four very efficient and effective heuristic algorithms in view of the 
strong NP-hardness of the problem. We also derive a lower bound for the problem and show that the 
worst-case error bound of each heuristic algorithm is 2. We also analyze the average performance of 
the heuristic algorithms. The results show that heuristic H2 performs better than the others. The 
results also indicate that heuristic H which considers the minimum of makespan obtained from four 
heuristic algorithms performs quite well and generates solutions that are within 0.8% of the optimal 
solution on average. 
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Ein working— forward Verfahren zur kostenorientierten 

Fliefibandabtaktung 

Matthias Amen 

lUC, Universitat Bern, Engehaldenstrasse 4, CH-3012 Bern 

Zusammenfassung: In diesem Beitrag wird ein neuartiges heuristisches Verfahren zur kostenorien- 
tierten Fliefibandabtaktung vorgestellt. Das „ Verfahren mit exakter Losung gleitender Problemaus- 
schnitte “ ist eine Anwendung der working-forward^^ Technik. Die Ergebnisse einer experimentellen 
Untersuchung zeigen, dafidie Ldsungsgiite signifikant besser ist als die aller bislang existierenden 
heuristischen Verfahren. Da der Rechenzeitbedarf handhabbar ist, kann der praktische Verfahrens- 
einsatz stark empfohlen werden. 



1 . Problemstellung 

Bei der Fliefihandfertigung durchlaufen die Produkteinheiten einer Produktart die aufeinanderfol- 
genden Stationen des Fliefibandes mit konst anter Transportgeschwindigkeit. Zur Bearbeitung eines 
Arbeitsganges steht in jeder Station des Fliefibandes die Taktzeit zur Verfiigung, nach der jede Pro- 
dukteinheit die bisherige Station verlafit und in der nachstfolgenden Station zur Bearbeitung zur 
Verfiigung steht. 

Die Gesamtbearbeitungsaufgabe besteht aus einer Menge okonomisch sinnvoll nicht weiter teilbarer 
Ablaufelemente, die als Verrichtungen bezeichnet werden. Gegenstand der Fliefibandabtaktung ist 
die zielorientierte Zerlegung der Menge der Verrichtungen in Teilmengen, die jeweils in einer zu 
bildenden Station auszufiihren sind. Dabei sind die Begrenzung der zeitlichen Stationsarbeitsinhalte 
je Produkteinheit durch die Taktzeit und die zwischen den Verrichtungen bestehenden technolo- 
gischen Reihenfolgebeziehungen (sowie die Nichtteilbarkeit der Verrichtungen) zu beachten. Diese 
Rahmenbedingungen lassen sich mit den folgenden Symbolvereinbarungen (Dimensionen: PE Pro- 
dukteinheiten, ZE Zeiteinheiten) 



I 


Anzahl der Verrichtungen [-] 


di Dauer der Verrichtung i [ZE/PE], 


h,ij 


Verrichtungsindex, 


i = 1(1)7 




/i, ij e {!,...,/} 




M 


Anzahl der Stationen [~] 


mi Station, der die Verrichtung i 


m 


Stationsindex, me {1, . . • , M} 


zugeordnet ist, z = 1(1)/ 


P 


Verrichtungsmenge (Arbeitsgang) in 


Vi Menge der technologisch unmittelbaren 




Station m, m = 1(1)M 


Vorgangerverrichtungen der 


C 


Taktzeit [ZE/PE] 


Verrichtung z, z = 1(1)/ 


wie folgt formal beschreiben: 
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Zerlegung der Gesamtmenge der Verricht ungen: 

M 



m=l 

n /^ = 0, Vm, m G {1, . . . , M} mit m^fh 



Einhaltung der Taktzeit: 

^ d^i <c^ m = 1(1)M 

*€/4i 

Einhaltung der Reihenfolgebeziehungen zwischen den Verricht ungen: 

V/i G Vi, z = 1(1)/ 



( 1 ) 

( 2 ) 

(3) 

(4) 



Bei der zeitorientierten Fliefibandabtaktung wird fiir eine gegebene Taktzeit c angestrebt, die Anzahl 
der Stationen M zu minimieren. Die kostenorientierte Fliefibandabtaktung (vgl. [9], S. 65-122; [8]; [1]; 
[2]) stellt eine Verallgemeinerung der zeitorientierten Fliefibandabtaktung dar: Es wird angestrebt, 
die von der Abtaktung abhangigen Lohnkosten und Kapitalkosten je Produkteinheit zu minimieren. 
Zur Formulierung der Zielfunktion werden weitere Symbole vereinbart (zusatzliche Dimension: GE 
Geldeinheiten): 



k^ Lohnkostensatz in der Station m 
[GE/ZE], m = 1(1)M 

kf Lohnkostensatz der Verrichtung i 
[GE/ZE], i = 1(1)J 

kl Kostensatz der Verrichtung i [GE/ZE], 
z = l(l)/ 



von der Fliefilinienlange abhangige 
Kapitalkosten fiir eine Station je 
Produkteinheit [GE/PE] 
k^ Kostensatz in der Station m [GE/ZE], 

771 = 1(1)M 

k relevante Stiickkosten [GE/PE] 



Die Kapitalkosten je Produkteinheit bestehen aus den Abschreibungen und den kalkulatorischen Zins- 
en fiir das Fliefiband und hangen bei identischen Stationslangen allein von der Anzahl der Stationen 
ab. Sie betragen M'k^^. Die Lohnkosten sind durch die Anzahl der Stationen M und die in den Sta- 
tionen jeweils giiltigen Lohnsatze /cjJ bestimmt. Dabei entspricht der in einer Station m zu vergiitende 
Lohnsatz k^ bei anforderungsorientierter Entlohnung dem maximalen Lohnsatz kf derjenigen Ver- 
richtungen z, die den Arbeitsgang dieser Station bilden: k^ = maxj/cj'^ \iel^},m = 
Unabhangig von dem Arbeitsinhalt in einer Station ist je Produkteinheit stets die Taktzeit zu ent- 

M 

lohnen. Die gesamten Lohnkosten je Produkteinheit betragen somit ^ ’ ^m- 

m=l 



M 

Die relevanten Stiickkosten k betragen somit k = ^ c • + M • Mit der Definitions- 

m=l 

gleichung der Stations A;Qsfe7zs atze k^ := k^ + y^/c konnen die Stiickkosten k auch durch k = 

M M / j^f3\ M 

C' k^-\- M • y^ = c • I A;^ H 1 = XI ^ beschrieben werden. Anhand der Definitions- 

m=l m=l V ^ / m=l 

gleichung der Verricht ungsifcosi^satze k\ := kf -\-y^ jc lassen sich die Stationskostensatze A;^ formal 
wie folgt ermitteln: 
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Stationskostensatze: 



= max{A;V|i € /^} , m = 1(1)M 



( 5 ) 



Die Zielfunktion des kostenorientierten Abtaktungsproblems kann damit wie folgt formuliert werden: 



Min 



k = 



M 



m=l 



( 6 ) 



Einzuhalten sind die Nebenbedingungen (1), (2), (3), (4) und (5). 

2. Ein working-forward Verfahren 

Fiir die betrachtete Problemstellung wurde in [1] erstmals ein allgemein formuliertes exaktes Ver- 
fahren vorgestellt. Da die kostenorientierte FlieBbandabtaktung zu den A/’‘P-schweren Problemen 
gehort (vgl. [12], S. 56; [8], S. 479), konnen exakte Verfahren nur bis zu einer gewissen Pro- 
blemgrofie innerhalb einer vert ret baren Rechenzeit optimale Losungen ermitteln. Heuristische Ver- 
fahren fiihren in einer vertret baren Rechenzeit zu einer guten, jedoch nicht notwendigerweise opti- 
malen Losung. Da mit Ausnahme des exakten Verfahrens samtliche bislang existierenden Verfahren 
zur kostenorientierten FlieBbandabtaktung das in der zeitorientierten FlieBbandabtaktung bekannte 
Optimierungskriterium der maximalen Stationsbelegung (vgl. [5], S. 266) verwenden, jedoch bekannt 
ist, daBkostengiinstige Abtaktungen verfehlt werden konnen, wenn die Stationen maximal belegt 
werden (vgl. [1]), liegt es nahe, eine heuristische Version des exakten Verfahrens zu entwickeln. In 
diesem Beitrag wird mit dem ^Verfahren mit exakter Losung gleitender Problemausschnitte^^ erst- 
mals ein heuristisches Verfahren zur kostenorientierten FlieBbandabtaktung vorgestellt, bei dem die 
Stationen nicht notwendigerweise maximal belegt werden. 

Das Verfahren kann als ''working-forward” Verfahren (vgl. [4], S. 315 f.) klassifiziert werden. Sukzes- 
sive werden handhabbare Teilprobleme, die als Problemausschnitte bezeichnet werden, aus dem zu 
losenden Abtaktungsproblem herausgegriffen und anhand des in [1] skizzierten exakten Verfahrens 
optimal gelost. Der Anfangswert fiir die obere Schranke der minimalen Kosten in einem Problem- 
ausschnitt wurde jeweils anhand der Prioritatsregel „giinstigste Veranderung der aktuellen Station- 
sleerkosten“ (vgl. [2]) ermittelt. Von der Losung eines Problemausschnitts werden die ersten Sta- 
tionen endgiiltig fixiert; die letzten Stationen werden wieder aufgehoben. Sodann wird der nachste 
Problemausschnitt aus den noch nicht endgiiltig zugeordneten Verrichtungen gebildet und optimal 
gelost. Das Verfahren ist beendet, sobald samtliche Verrichtungen endgiiltig einer Station zugeordnet 
wurden. Aufgrund der iiberlappenden Vorgehensweise werden Ineffizienzen, die an den Schnittstellen 
aufeinanderfolgender Problemausschnitte entstehen konnten, vermieden. Ahnliche Verfahren zur 
Losung anderer FlieBbandabtaktungsprobleme wurden in [10], [11] und [7] vorgestellt. Zur formalen 
Verfahrensbeschreibung werden weitere Symbole eingefiihrt: 
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7^^ Hochstanzahl der Verrichtungen eines Problemausschnittes [-] 
irest Menge der noch nicht zugeordneten Verrichtungen 
peti Menge der Verrichtungen eines Problemausschnittes 
^kum Anzahl der bislang endgiiltig festgelegten Stationen [-] 

Mteti Anzahl der in einer Leistungsabstimmung eines Problemausschnittes gebildeten 
Stationen [-] 

Anzahl der in einer Leistungsabstimmung eines Problemausschnittes gebildeten 
Stationen, die endgiiltig festgelegt werden [-] 

AnteilM Anteil an den in einer Leistungsabstimmung eines Problemausschnittes gebildeten 
Stationen, der maximal endgiiltig festgelegt wird [-] 

Voraussetzung fiir die in der Darstellung 1 algorithmisch beschriebene Vorgehensweise ist eine topo- 
logische Verrichtungsnumerierung ^ d.h. fiir alle Vorgangerverrichtungen h € Vi einer Verrichtung i 
mufi h < i gelten. Ist diese Voraussetzung nicht erfiillt, miissen die Verrichtungen zuvor umnumeriert 
werden. 



Algorithmus: Exakte Losung gleitender Problemausschnitte 


1 


Mkum 0, /*«< /’•“* := {1, . . . , /}. 

(Voraussetzung: topologische Numerierung) 


2 


Anwendung eines exakten Verfahrens zur Leistungsabstimmung auf 7*®*^ 


Ergebnis: Stationen mit den Stationsbelegungen m = -h -f 


3 


Von den Stationen werden die ersten := max jl, j^AnteilM • Stationen mit 

den dazugehorigen Stationsbelegungen endgiiltig eingerichtet. Diese bilden die Stationen 
^kum ^ i^kum Stationsbelegungen m = -f 1(1)M*^“^ 4- 


4 


jrest ._ jrest _ 


5 


^kum ._ ^kum Af/*®, 


6 


Gilt > 7^^, dann bestimme 7*®*^ (die 7^^ ersten Verrichtungen aus 

(Voraussetzung: topologische Numerierung)) und gehe zu 2. 


7 


Anwendung eines exakten Verfahrens zur Leistungsabstimmung auf 7^®*^ 


Ergebnis: Stationen mit den Stationsbelegungen 7^, m = -h 1(1)M*^^”^ -h M*®*^ 

M \= -f M*®*^ Das Verfahren ist beendet. 



Darst. 1: Algorithmus: Exakte Losung gleitender Problemausschnitte 



Das Verfahren arbeitet mit den beiden Parametern 7^^ und AnteilM. Ein Problemausschnitt besteht 
aus den 7^^ Verrichtungen kleinster Numerierung aus der Menge der noch nicht zugeordneten 
Verrichtungen 7^®^^ Gilt < |7^^|, dann ist der aktuelle Problemausschnitt der letzte. Ist 

|7re5t| ^ |7^^|, dann besitzt der letzte Problemausschnitt nur Verrichtungen. Aufgrund der 
topologischen Numerierung ist die Einhaltung der zwischen den Verrichtungen bestehenden Rei- 
henfolgebedingungen bei der Bestimmung der Problemausschnitte gewahrleistet. Mit Ausnahme 
des letzten Problemausschnitts werden von den fiir einen Problemausschnitt ermittelten Sta- 
tionen die ersten Stationen endgiiltig festgelegt. Die Anzahl wird dabei durch := 
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max |l, I^AnteilM • } berechnet. Sie ergibt sich grundsatzlich unter Beriicksichtigung der Ganz- 

zahligkeit ([aj grofite natiirliche Zahl, die a nicht iibersteigt) aus dem mit AnteilM bezeichneten 
Anteil an den in einem Problemausschnitt gebildeten Stationen. Damit das Verfahren auch 
bei ungiinstigen Parameter- und Datenkonstellationen nicht in eine Endlosschleife gerat, wird je- 
doch von jeder Teillosung mindestens eine Station fixiert. Die Verricht ungen, die den 
letzten Stationen einer Teillosung zugeordnet wurden, werden als nicht zugeordnet deklariert. Im 
nachsten Problemausschnitt werden in der Reihenfolge aufsteigender Numerierung die nachsten 
noch nicht zugeordneten Verrichtungen beriicksichtigt und die nachsten Stationen endgiiltig 
gebildet. Diese Vorgehensweise wird solange wiederholt, bis samtliche Verrichtungen endgiiltig einer 
Station zugeordnet wurden. In Darstellung 2 ist der Verfahrensablauf skizziert: 



Uisung des ersten 
Problemausschnitt es 



I I 

FLxierung 

Y 



Lbsung des zwdten 
Pioblemaussc hnittes 




Lcgendc 

I I Stationen der Ldsung des aktuellen Problemausschnittes 
I endgUltig fixierte Stationen 

I 50 % der StationcnanzahL der Losung des aktuelLen 
Problemausschnittes (AnteilM = 0,5) 



P'iji^igP 2 ng Ldsung des dntten 

Problemausschnittes 

▼ I 




Darst. 2: Sukzessive L5sung gleitender Problemausschnitte 



In der Darstellung 2 erfordert die Losung des ersten Problemausschnitts M^®*^ := 5 Stationen 
(m = Bei AnteilM = 0,5 sind (bis zu) 50% der in einer Losung gebildeten Statio- 

nen endgiiltig zu fixieren. Aufgrund der Ganzzahligkeitserfordernis sind dies im Beispiel := 
max |l, 1^ AnteilM • M^®*^J} = max{l, [0.5 • 5J} = 2 Stationen (m = 1,2). Die in der Losung 
vorgesehenen Verrichtungszuordnungen zu den Stationen 3 bis 5 werden wieder aufgehoben. An- 
schliefiend wird der nachste Problemausschnitt definiert und optimal abgetaktet. In der optimalen 
Losung werden nun M*®*^ := 4 Stationen (m = 3, . . . , 6) benotigt. Hiervon werden := 

max {l, [AnteilM • = max{l, [0.5 • 4J} = 2 Stationen (m = 3,4) endgiiltig fixiert. Die 

Verrichtungszuordnungen zu den Stationen 5 und 6 wiederum aufgehoben. 



3. Ergebnisse einer experiment ellen Untersuchung 

3.1 Design des Rechenexperimentes 

In einem Rechenexperiment wurde das im vorangegangenen Abschnitt vorgestellte Verfahren hin- 
sichtlich seiner Losungsgiite und seines Rechenzeitbedarfs untersucht. Zur Generierung von Test- 
problemen wurden die folgenden Prohlemparameter herangezogen. Dabei gibt die Vorrangstrenge 
das Verhaltnis der vorhandenen zu der maximal moglichen Anzahl der mittelbar und unmittelbar 
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zwischen den Verrichtungen bestehenden Reihenfolgebeziehungen an 


(vgl. [6], 5 


5. 736). 




• Verrichtungsanzahl I [-]: 


50 


75 


100 


• Vorrangstrenge VS [-]: 


0,7 


0,8 


0,9 


• Taktzeit zu maximaler Verrichtungsdauer [-]: 


2 


3 




• maximale zu minimaler Verrichtungsdauer 


5 


10 




• maximaler zu minimalem Verrichtungskostensatz j_j 


1,25 


1,50 


2,00 



Anhand einer systematischen Variation der Problemparameter auf den angegebenen Stufen wurden 
in 108 Testfeldern (2^ x 3^ Versuchsdesign) zufallig und unabhangig voneinander jeweils 50 Probleme 
generiert. Insgesamt standen somit 5.400 Test probleme zur Verfiigung. 

Das ,,Verfahren mit exakter Losung gleitender Problemausschnitte“ wurde mit verschiedenen Kombi- 
nationen der Verfahrensparameter untersucht. Die Hochstanzahl der Verrichtungen eines Problem- 
ausschnittes variierte auf zwei Stufen (20 und 30), der Anteil an den in einer Leistungsab- 
stimmung eines Problemausschnittes gebildeten Stationen, der maximal endgiiltig festgelegt wird, 
AnteilM, auf drei Stufen (50 %, 70 % und 90 %). Insgesamt standen somit sechs Verfahrensversionen 
zur Verfiigung. 

3.2 Losungsgiite 

Die Beurteilung der Losungsgiite des Verfahrens ist Teil einer umfangreichen Untersuchung. Als 
Beurteilungsmafistab wurde die jeweilige prozentuale Abweichung von der besten der von 22 einge- 
setzten Verfahren ermittelten Losungen herangezogen (im folgenden als Losungsabweichung bezeich- 
net). Bei den eingesetzten Verfahren handelt es sich neben den sechs Versionen des hier vorgestellten 
Verfahrens um das mit einer Rechenzeitbeschrankung von 3.600 Sekunden je Testproblem versehene 
exakte Verfahren zum Auffinden einer Losung (vgl. [1]), um ein Zufallsauswahlverfahren sowie um 
eine Reihe von Prioritatsregelverfahren (vgl. [2] mit weiteren Nachweisen, [9], S. 105 - 110; [8], S. 
481 - 484; [3], S. 110 - 124]). Die jeweils beste bekannte Losung wurde als Beurteilungsgrundlage 
wurde gewahlt, da das Kostenminimum nicht fiir samtliche Testprobleme ermittelt werden konnte, 
und da beobachtet wurde, daBeine alternativ heranziehbare untere Schranke fiir das Kostenmini- 
mum systematisch vOn diesem abweicht. Der letztlich verwendete Vergleichswert stellt nachweislich 
in 86,48 % der Falle das Kostenminimum dar. In der Darstellung 3 wurden die Versionen des Ver- 
fahrens mit exakter Losung gleitender Problemausschnitte mit F-vXXsYY bezeichnet, wobei die 
Platzhalter XX und YY fiir die Verfahrensparameter und AnteilM (in %) stehen. Zum Vergleich 
ist unter der Bezeichnung E- 1-3600 auch das mit einer Laufzeitbegrenzung versehene (und daher 
unvollstandige) exakte Verfahren aufgefiihrt. Die weiteren in der Darstellung aufgefiihrten heuri- 
stischen Verfahren stellen jeweils die hinsichtlich der Losungsgiite besten Verfahren innerhalb einer 
Klasse vergleichbarer Verfahren dar. Z-1000 stellt ein Zufallsauswahlverfahren dar, bei dem die beste 
von 1.000 zufalligen Abtaktungen als Losung gespeichert wird. Mit H-KSA wurde das „Kostensatz- 
Angleichungs- Verfahren" (“Wage-Rate-Smoothing”) bezeichnet, das innerhalb der Klasse der Ver- 
fahren, die mehrere Prioritatsregeln einsetzen, tendenziell die giinstigsten Abtaktungen generiert (vgl. 
[8]). P-MinLK ist die Prioritatsregel “giinstigste Veranderung der aktuellen Stationsleerkosten", die 
zu signifikant besseren Losungen fiihrt als die iibrigen einfachen Prioritatsregelverfahren (vgl. [2]). 
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Rang 


Verfahren 


Verfahrcns- 


Abweichimg von der besten 


Anteil 


Signifikanzniveau 






parameter 


gefundenen Losung 


beste 








(nur bei 


Mittel- 


Standard- 


Mini- 


Maxi- 


Losung 


aSesam. 0,00449 






gleitenden 


wert 


abwei- 


mum 


mum 




^einzc. qqooi 






Problem- 

ausschnitten) 




chung 








ZiQ.: 3,89059 


[-] 




iPA 


AnteilM 


[%] 


[%] 


[%] 


[%] 


[%] 


1 kein signifikanter 
1 Unterschied 


1 


E-1-3600 


0,0465 


0,3172 


0,0 


4,9 


96,96 




2 


F-v30s50 


30 


0,5 


2,8394 


2,3424 


0,0 


14,6 


16,85 


, 1 


3 


F-v30s70 


30 


0,7 


3,0029 


2,4385 


0,0 


21,4 


17,15 




4 


F-v30s90 


30 


0,9 


3,0251 


2,3212 


0,0 


14,1 


15,22 




5 


F-v20s50 


20 


0,5 


4,3742 


2,7335 


0,0 


19,1 


6,72 




6 


F-v20s70 


20 


0,7 


4,7128 


2,7346 


0,0 


19,1 


5,35 




7 


F-v20s90 


20 


0,9 


5,4423 


3,0171 


0,0 


21,4 


4,52 




8 


Z-1000 






6,8867 


3,3094 


0,0 


17,7 


2,04 




9 


H-KSA 






7,7618 


3,4812 


0,0 


24,7 


0,80 




10 


P-MinLK 






9,6731 


3,8708 


0,0 


27,6 


0,33 





Darst. 3: Vergleich der L5sungsgiite 



Das Verfahren F-v30s50 lieferte mit einer mittleren Losungsabweichung von 2,8394 % die giinstigsten 
Ergebnisse der heuristischen Verfahren. Mit einer maximalen Losungsabweichung von lediglich 14,6 % 
kann diese Verfahrensversion als vergleichsweise ausreifierunanfallig angesehen werden. Fiir 16,85 % 
aller Testprobleme konnte F-v30s50 die beste von alien eingesetzten Verfahren berechnete Losung 
ermitteln. 

Die Erwartungswerte der Losungsabweichung wurden anhand eines multiplen z-Tests (z standardnor- 
malverteilte Priifgrofie) miteinander verglichen. Mit = 0,0001 wurde das Signifikanzniveau 

fiir einen einzelnen Paarvergleich derart gewahlt, dass ein aufgrund des Tests vermuteter Unter- 
schied zwischen den Erwartungswerten mit hoher Wahrscheinlichkeit auch realiter besteht. Der Er- 
wartungswertevergleich zeigt, dafi das Verfahren mit exakter Losung gleitender Problemausschnitte 
die iibrigen heuristischen Verfahren deutlich signifikant dominieren. Dabei liefern die Versionen mit 
maximal 30 Verrichtungen je Problemausschnitt = 30) signifikant bessere Ergebnisse als diejeni- 
gen mit = 20 Verrichtungen. Bei den Paarvergleichen der Verfahren F-v30s50 und F-v30s70 sowie 
der Verfahren F-v30s70 und F-v30s90 wurden keine signifikanten Unterschiede festgestellt. Bei den 
Versionen mit = 20 Verrichtungen verbessert sich die mittlere Losungsabweichung signifikant, 
wenn von einer Stufe des Parameters AnteilM zur nachstniedrigeren gewechselt wird. Mit einer 
steigenden Anzahl von Verrichtungen je Problemausschnitt wird in jedem Problemausschnitt die 
Struktur des Gesamtproblems besser beriicksichtigt. Durch einen geringen Anteil zu fixierender Sta- 
tionen (AnteilM) werden die hinteren Stationsbelegungen einer Teillosung revidiert, wenn sie sich in 
der nachsten Teillosung als ungiinstig erweisen. Die Losungsgiite steigt demzufolge mit dem Umfang 
der Revisionsmdglichkeiten. Die folgende Darstellung 4 zeigt die Losungsgiite der Verfahrensvari- 
anten in Abhangigkeit von den vorhandenen Problemparametern. Die in einem Teil vergleich je- 
weils nicht aufgefiihrten Problemparameter nehmen dabei samtliche untersuchten Auspragungen an. 
Rangplatzvertauschungen gegeniiber dem Totalvergleich sind grau unterlegt. 
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Abweicbuitg von der besten gefuodenen Ldsung | 


Verfahren 


Mitiel- 

wert 


Rang 


Mittd- 

wert 


Rang 


MitteU 

wert 


Rang 


Minelwert 


Rang 


Mittelwert 


Rang 




[%] 


fd 


{%} 


Id 


|%1 


Id 


[%] 




[%] 


td 






Verrichtungsanzahl 1 




VerriditungsdauerverhiLltnis 




50 


75 


100 


5 




10 




F v30s50 


2,4576 


3 


3,2489 


2 


2,8116 


2 


2,7231 


2 


2,9557 


2 


F v30s70 


2,6333 


4 


3.^75 


4 


2,8877 


3 


2,8430 


3 




4 


F-v30s90 


2,4068 


2 


3.4268 


3 


3,2418 


4 


2,9201 


4 




3 


F-v20s50 


3,9418 


5 


4,7044 


5 


4.4764 


5 


4,1239 


5 


4,6245 


5 


F v20s70 


4,2703 


6 


5,0665 


6 


4.8017 


6 


4,5009 


6 


4,9248 


6 


1 F^v20s90 


4,4936 


7 


6,1541 


7 


5.6792 


7 


5,2683 


7 


5,6163 


7 






Vorrang^tretigc VS 




Verfaaitnis von Taktzeit zu maxinuder 
Verrichtung^uer 




0.7 _ . 


0.8 


W 


2 




3 




F-v30s50 


2,9396 


2 


3,0648 


2 


2.5138 


2 


2.3799 


2 


3,2989 


2 


F-v30s70 


3,1627 


3 


32505 


4 


£5s» 


4 


2,4735 


3 


3,5322 


3 


F-v30s90 


3,2879 


4 


3,2109 


3 


ZS7fiS 


3 


2.4744 


4 


3,5759 


4 


F-v20s50 


4.5656 


5 


4,5939 


5 


3,9630 


5 


3,7896 


5 


4,9588 


5 


F-v20s70 


S.00S8 


6 


4,9757 


6 


4,1540 


6 i 


4,1465 


6 


5,2792 


6 


F-v20s90 


5.9589 


7 


5,6932 


7 


4,6748 


7 1 


4,6285 


7 


6,2561 


7 




VerHchtu ng^kostensatzvcrh^tnis 

l^v,nax^l^v,iniii 


To 

Mittelwert 


tal 

Rang 






1.25 


L50 


2.00 


[%] 




Id 




F-v30s50 


3,2428 


2 


3,0539 


2 


2,2214 


2 


2,8394 


2 




F v30s70 


3,3300 


3 


3,2096 


4 


2,4689 


4 


3,0029 


3 




F-v30s90 


3,4719 


4 


3.1470 


3 


2^4564 


3 


3,0251 




4 




F-v20s50 


4,3045 


5 


4,6395 


5 


4,1786 


5 


4,3742 


5 




F-v20s70 


4,6359 


6 


4,9707 


6 


4,5319 


6 


4,7128 


6 




F^v20s90 


5.4741 


3 


5.6147 


7 


5,2381 


7 


5,4423 


7 





Darst. 4: Losungsgiite der Verfahrensvarianten in Abhangigkeit von den Problemparametem 

Bei den Rangplatzvertauschungen wurden auf dem Signifikanzniveau von = 0,0001 keine 

signifikanten Unterschiede festgestellt. Bei i 25 fiihrte das Zufallsauswahlverfahren 

Z-1000 zu einer geringeren Losungsabweichung als F-v20s90, der Unterschied erwies sich ebenfalls auf 
dem Niveau von a®**'*®^ = 0, 0001 als nicht signifikant. Insgesamt kann somit die in dem Totalvergleich 
ermittelte Rangordnung der Verfahren als invariant gegeniiber Anderungen der Problemcharakter- 
istika angesehen werden. Rangplatzvertauschungen zwischen den Verfahren Z-1000, H-KSA und 
P-MinLK fanden nicht statt. 



3.3 Rechenzeiten 

Die gemessenen Rechenzeiten hangen von der Implementierungsart und der eingesetzten Hardware 
ab. Samtliche eingesetzte Verfahren wurden unter BORLAND PASCAL 7.0 programmiert und im 
protected mode implement iert. Verrichtungsmengen (z. B. Stationsbelegungen, Vorgangermengen) 
wurden mit Hilfe des Datentyps “Set” abgespeichert. Dies ermoglichte rechenzeitsparende Bitope- 
rationen. Die gemessenen Rechenzeiten wurden auf einen Intel-Pentium Rechner mit 133 MHz Takt- 
frequenz normiert. 

Eine Analyse des Verfahrensablaufs ergab eine Worst- Case- Rechenzeitfunktion von b •0{W •/) 
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bei bereits topologisch numerierten Verrichtungen und h -0{P) bei noch nicht topologisch nu- 
merierten Verrichtungen. Dabei bezeichnet b die exponentiell von der Grofie des Verfahrenspa- 
rameters abhangige Konstante fiir die Worst-Case-Rechenzeit zu Losung eines einzelnen Proble- 
mausschnitts und W die Anzahl der unterschiedlichen Lohngruppen bzw. Kostensatze der Verrich- 
tungen. Letztere korrespondiert unmittelbar mit dem Verrichtungskostensatzverhaltnis 
(Eine hohere Lohndifferenzierung geht auch in der Praxis mit einer Vergrofierung der Lohngruppenan- 
zahl einher.). Die bei unterschiedlichen Verfahrensparameter AnteilM) und unterschiedlichen 
Problemparametern (/, bzw. W) gemessenen Rechenzeiten sind der Darstellung 5 zu 

entnehmen. 



Anteil 


Kostensatz- 


Kostensatz- 


Durchschnittliche Rechenzeit bei einer Hochstanzahl der 


zu fixierender 


verhaitnis 


gruppen- 


Verrichtungen eines Problemausschnittes I^^ von / 


Stationen 




anzahl 




bei einer Verrichtungsanzahl I von 












I*’'^=20 










AnteilM 




W 


1=50 


1=75 


1=100 


1=50 


1=75 


1=100 


[-1 


[-1 


H 


[si 


[s] 


[s] 


[s] 


[s] 


[s] 




1,25 


2 


0,40 


1,25 


2,39 


1,19 


4,86 


10,98 


0,5 


1,50 


3 


0,74 


2,67 


5,90 


2,85 


15,58 


42,86 




2,00 


5 


1,25 


4,97 


12,05 


5,42 


35,69 


128,04 




1,25 


2 


0,32 


0,97 


1,97 


1,02 


3,65 


7,56 


0,7 


1,50 


3 


0,61 


2,17 


4,83 


2,59 


10,85 


32,75 




2,00 


5 


1,02 


4,03 


10,00 


5,02 


27,45 


98,29 




1,25 


2 


0,24 


0,70 


1,34 


0,71 


2,51 


6,14 


0,9 


1,50 


3 


0,45 


1,68 


3,45 


1,81 


8,16 


21,68 




2,00 


5 


0,77 


2,89 


7,37 


3,31 


21,01 


63,80 



Darst. 5: Rechenzeiten 



Die Verrichtungen der Testprobleme sind topologisch numeriert. Der in der (9-Notation angegebene 
Worst-Case-Rechenzeit verlauf spiegelt sich in den gemessenen Rechenzeiten nur teilweise wider. 
Dies liegt daran, dafidas zur Losung eines Problemausschnitts eingesetzte exakte Verfahren eine 
Vielzahl bedingter Verfahrensablaufe vorsieht, die dem Worst-Case- Verfahrensablauf faktisch nicht 
entsprechen. Die grundsatzliche Entwicklung der Rechenzeit bei einer "Anderung der Problem- 
charakteristika ist jedoch erkennbar: Fiir die praktische Anwendung eines Verfahrens mit exakter 
Losung gleitender Problemausschnitte darf daher der Wert nicht zu grofigewahlt werden, um 
in angemessener Rechenzeit eine Leistungsabstimmung zu erhalten. Eine Verringerung des Anteils 
endgiiltig zu fixierender Stationen AnteilM fiihrt tendenziell zu einem leicht abschatzbaren und nicht 
gravierenden Anstieg der Rechenzeit. 



4. Fazit 

In einem Vergleich mit den bislang existierenden heuristischen Verfahren zur kostenorientierten 
Fliefibandabtaktung hat sich das „ Verfahren mit exakter Losung gleitender Problemausschnitte “ 
hinsichtlich der Losungsgiite als stark signifikant iiberlegen erwiesen - dies unabhangig von den 
Auspragungen der Problemcharakteristika. Bei dem Einsatz des Verfahrens zur Losung grofierer Ab- 
taktungsprobleme in der Praxis ist ein Tradeoff zu losen: Einerseits verbessert sich die Lbsungsgiite 
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mit zunehmender Anzahl von Verrichtungen je Problemausschnitt, andererseits verschlechtert sich 
die Rechenzeit. Da eine Fliefibandabtaktung iiber einen langeren Zeitraum unverandert ist, soil- 
ten mehrere Abtaktungen mit dem vorgestellten Verfahren durchgefiihrt werden, wobei schrittweise 
die Verrichtungsanzahl je Problemausschnitt heraufgesetzt wird, und letztlich die beste gefundene 
Losung realisiert wird. Der Anteil endgiiltig zu fixierender Stationen hingegen sollte von vornherein 
moglichst gering gehalten werden. Gegeniiber einem hohen Anteil fiihrt dies tendenziell zu einem 
vertretbaren Anstieg der Rechenzeit. Schliefilich sei darauf hingewiesen, dafi das ,, Verfahren mit ex- 
akter Losung gleitender Problemausschnitte“ aufgrund der leichten Ubertragbarkeit auch zur Losung 
ahnlicher Reihenfolgeprobleme mit Vorrangbeziehungen angewendet werden kann. 

Literatur 

[1] Amen, Matthias: Ein exaktes Verfahren zur kostenorientierten Fliefibandabstimmung, in: Uwe 
Zimmermann u.a. (Hrsg.): Operations Research Proceedings 1996, Berlin u. a.: Springer, 

1997, S. 224-229. 

[2] Amen, Matthias: Prioritatsregeln zur kostenorientierten Leistungsabstimmung von Fliefilinien, 
in: Peter Kischka u.a. (Hrsg.): Operations Research Proceedings 1997, Berlin u. a.: Springer, 

1998, S. 456-461. 

[3] Heizmann, Jochem: Soziotechnologische Ablaufplanung verketteter Fertigungsnester zur Erho- 
hung der Flexibilitat von Montage-Fliefilinien, Karlsruhe: Jochem Heizmann Verlag, 1981. 

[4] Imboden, Carlo/Leibundgut, Andres/Siegenthaler, Peter: Klassifikation heuristischer Prinzi- 
pien: Ein methodologischer Beitrag zur Entwicklung von heuristischen Verfahren, in: Die 
Unternehmung, 32. Jg. (1978), S. 295-330. 

[5] Jackson, James R.: A computing procedure for a line balancing problem, in: Management 
Science, Vol. 2 (1956), S. 261-271. 

[6] Mastor, Anthony A[than]: An experimental investigation and comparative evaluation of pro- 
duction line balancing techniques, in: Management Science, Vol. 16 (1970), S. 728-746. 

[7] Reeve, N. Richard/Thomas, Warren H.: Balancing stochastic assembly lines, in: AIIE [Amer- 
ican Institute of Industrial Engineers] Transactions, Vol. 5 (1973), S. 223-229. 

[8] Rosenberg, Otto/Ziegler, Hans: A comparison of heuristic algorithms for cost-oriented assem- 
bly line balancing, in: Zeitschrift fiir Operations Research, Band 36 (1992), S. 477-495. 

[9] Steffen, Reiner: Produktionsplanung bei Fliefibandfertigung, Wiesbaden: Gabler, 1977. 

[10] Sturm, L. B. J. M.: An Improved Method for Balancing Assembly Lines, working paper R 
70/1, Interfaculty for Graduate Studies in Management, [Rotterdam, Niederlande], 1970. 

[11] Thomas, Warren H. /Reeve, N. Richard: Balancing continuous stochastic assembly lines: in: 
American Institute of Industrial Engineers (Hrsg.): Technical Papers, 23rd Annual Conference 
and Convention of the American Institute of Industrial Engineers, Anaheim, California, 1972, 
S. 409-419. 

[12] Wee, T. S. /Magazine, M[ichael] J.: Assembly line balancing as generalized bin packing, in: 
Operations Research Letters, Vol. 1 (1982), S. 56-58. 




Section 7: 



Energy & Ecology 

Section-Chair: A. Haurie, A. Kalliauer 




Allocation Problem for Ecological Planning 

A. Natalia I. Kalashnikova, Academy of Banking 
B. Alexander V. Boiko, Insurance Company “Megapolis” 

1. Introduction 

After the Chernobyl catastrophe, the ecological situation in Ukraine is subject to the serious control. 
In the Sumy region (the North-East of Ukraine), a lot of chemical industry enterprises are allocated. 
Many of them are extremely in need of a thorough reconstruction and/or modernization, in order to 
diminish their influence upon the environment with gases, liquids, solid wastes, etc. 

As an example, we consider the well-known Sumy Chemical Production plant (SCP), formerly one 
of the biggest European producers of mineral (especially phosphate) fertilizers. Since recently, it has 
been included into the TACIS Project 9601 “Energy Saving and Efficiency in the Chemical Industry” . 
The principal targets of the Project are: improvement of efficiency of the power consumption by 
various production units of the SCP, application of certain know-how techniques dealing with the 
power supply efficiency, as well as attracting investments to the SCP. 

In the frames of the above-mentioned project, ecological management has been carried out with 
aims of listing the SCP’s ecological problems, and elaborating a programme to minimize potential 
punishment of the enterprise for the ecological damage. This ecological programme included two 
stages: 

• ecological audit; 

• ecological training. 

The ecological audit was accomplished by Dr. Alan Fowler and Dr. John Clifford, the Senior 
Consultants in Ecology at the March Consulting Group, England, since January 20 until February 
5, 1998. Due to the shortage of time, the main auditing activity was focused at the following 
departments (shops) of the SCP: 

• Titanium Dioxide Shop; 

• Sulphuric Acid Shop # 5; 

• Phosphoric Acid Shop; 

• Ammophos Shop; 

• Defluorinated Phosphates Shop. 

According to the Ukrainian Legislation on Ecology, the SCP pays annually for the allowed amount 
of pollution. For instance, the sums payed by it in 1998 are listed below: 
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• Air Pollution 317,091 UHR^; 

• Liquid Pollution 35,317 UHR; 

• Solid Waste 417,277 UHR. 

However the audit has shown that each of the shops exceeds the feasible ecological damage limits 
by some items. For example, during the calcination (roasting) process, the Titanium Dioxide Shop 
violates the allowed amount of the Sulphur Dioxide emission to the great extent (550 versus 79.8 
tons per annum). The same problem is painful for the Sulphuric Acid Shop # 5. At the Ammophos 
Shop, the Fluorine pollution rate of 130 mg/m^ far exceeds the limit value 50 mg/m^. 

At last, the following limit concentrations in the liquid pollution are also violated: 



Item 


Limit value 


Real value 




(mg/litre) 


(mg/litre) 


Fluorides 


0.326 


3.36 


Sulphates 


100 


868 



When the pollution by a single shop exceeds the allowed limits, the Head of the Shop may be fined 
and/or prosecuted. If an enterprise’s total pollution violates the normal values, the former is fined 
by the sum ten times more than the regular fee. Strong violations of the pollution legislation can 
lead to closing the guilty shop or even the whole enterprise. 

In order to avoid these consequences, an enterprise needs investments to prevent the extra pollu- 
tion effluents caused by aged equipments and/or by accidents. To attract investors, the enterprise 
managers must have at their disposal a well-founded mathematical model that provides for a stout 
background of the amounts of capital intended for various units. Such a model is constructed in 
Section 2 of the paper. Section 3 is dedicated to description of an algorithm to solve the problem in 
question. At last, the results of numerical tests of the algorithm are presented in Section 4. 

2. Problem Specification 

On the one hand, municipal and regional authorities have enough power to fine the enterprises for 
various kinds of pollution. On the other hand, they can invest certain (very bounded) amounts of 
the capital into the modernization of some plants. However, the serious problem consists in that if 
one invests an amount of capital less than a certain basic threshold, the result is almost negligible in 
the sense of avoiding the pollution. Therefore, the investor need solving the following mixed integer 
programming problem. 

For the simplicity purpose, from now onward we restrict ourself to a single item of pollution. Consider 
n > 1 enterprizes numbered by i = 1, 2, . . . , n, with the probabilities 0 < pj < 1 of pollution outburst 
at each of them during a year or month term. Assume that in the case of the emergency effluent 
occuring, the i-th enterprize is fined by the sum fi > 0. Hence, the expectation of its financial loss 
is no less than pi/j. Futher, suppose that investment of the amount of capital no less than 6^ > 0 



1 UHR « 1 DM 
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reduces the probability of an emergency pollution effluent to qi < Pi. In practice, the salvage in fines 
fi(Pi - Qi) exceeds the needed invested funds hi significantly, i.e. 6* < fi{pi — qi). At last, assume 
that the investor (say, the regional authorities) has no more than E > 0 at his disposal for the term 
in question; moreover, an inequality holds 

( 1 ) 

t=l 

Taking all the above facts into account, we specify our problem as follows. Introduce binary variables 
Xi € {0,1}, i = 1,2, ...,n, and assume that the regional authorities invest a sum pi > bi into 
reconstruction of the i-th enterprise if and only if x* = 1, z = 1, 2, . . . , n. Then the following mixed 
integer programming problem arises: 

PI. find vectors x = (xi, 2 : 2 , . . • , Xn) and y = ( 2 / 1 , 2 / 2 , • • • , Vn) such that 

n n 

Z! + Z + Pi(l - a^i)] fi min, (2) 

i=l i=l 



subject to the constraints: 

2/i > 6t, i = l,2, (3) 

n 

Y.Xiyi<B, (4) 

i=l 

a;iG{0,l}, i = 1,2, ...,n. (5) 

It is easy to verify that (at least one) solution to problem (2)-(5) always exists. If the values > 0 
are constants, then it is easy to see that optimal values of pi are equal exactly to the respective bi. 
Therefore, (2)-(5) reduces to the “0-1” knapsack problem [3]: 

n 

maximize ^ djXj (6) 

i=i 

subject to 

'ZbjXj<B, (7) 

i=i 

Xj-e{0, 1}, i = ' (8) 

where 

dj = fjiPj - Ij) - > 0 (9) 

and bj > 0 for all j, the variables are ranked by a “prime indexing” so that 



d\> /b\> ^ 0^2' /^2' ^ ) 



where T, 2', . • • constitute a permutation of the numbers 1,2,... 

Then one can apply the “Greedy Algorithm” (cf. [3]) and solve the problem easily by taking xy as 
large as possible without violating the. constraint Y^bjXj < B (or any bound imposed on xy, as in 
the 0-1 problem), then taking X 2 > as large as possible subject to the value already assigned xy, and 



so on. 
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Example 1. Consider enterprises i = 1,2, 3, 4 that need investments of amounts y* > hi to reduce 
the probablities of accidents from pj down to qi\ an accident occuring at enterprise i results in the 
latter being fined by the sum /». Values of the above introduced parameters (6* and in thousand 
dollar) are presented in the table below. 



i 


bi 


Pi 


Qi 


fi 


1 


25 


0.3 


0.1 


200 


2 


60 


0.4 


0.1 


300 


3 


42 


0.1 


0.05 


1000 


4 


15 


0.2 


0.05 


400 



Assuming that investors have only B = \\Q thousand dollars at their disposal, determine the optimal 
investment plan. 



Solution. First compute by (9) the values d\ = 15, 6,2 = 30, dz = 8 , ^4 = 45. After comparing the 

ratios di/hi = 0.6, ^ 2/^2 = 0.5, 

dz/hz = 0.19, ^ 4/64 = 3, we clearly have 

1 ' = 4 , 2' = 1, 3' = 2, 4' = 3, 

and the optimal solution for our problem is 

X\ = 1, X 2 = 1, Xz = 0, x^ = 1. 

In the other words, the investors deliver their money for the reconstruction of enterprises 1,2, and 
4, by sums of 25, 60, and 15 thousand dollars, respectively; hence the total investment equals 100 
thousand dollars. 



3. Branch-and-Bound Algorithm to Solve Nonlinear Problem 



More complicated is the case when the probability parameters qi are not constants but functions 
of invested sums pj. More precisely, for each i = 1,2, ... ,n, we assume qi = qi{yi) to be a scalar 
non-decreasing function defined over the non-negative half axis and satisfying qi( 0 ) = Pi. 

Now the investor has to solve the following problem: 

n 

P2. min z = + f^Vi)] > (10) 

i=l 



subject to 



Xibi<yi<XiBj i = l,2, ...,n, 

t=l 

Xi € { 0 , 1 }, 2 = 1 , 2 ,. .., 71 , 



( 11 ) 

( 12 ) 

(13) 



which is clearly tantamount to problem (2)-(5) when the function 9<(j/i) is step-wise: 




if Vi < bi; 
if Vi > bi, 
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for all i. 

Also, additional constraints may be added to P2 to restrict the configuration of investments. Config- 
uration constraints, for example, might be included to prevent the investment location at enterprise 
j if an investment is located at, say, enterprise k. 

P2 is a mixed integer nonlinear programming problem (MINP) whose integer variables are restricted 
to the values 0 and 1. This problem is very close to the capacitated plant location problem described 
in [3]. Therefore following [3], we construct a kind of the branch-and-bound procedure to solve 

(10) -(13) 

Procedures proposed for solving P2 can be classified as “enumeration methods” to include “relaxation 
algorithms” and “partitioning algorithms” . Both algorithmic approaches are characterized by a tree 
diagram representing a systematic search for the optimal binary variables, Xj. 

For P2 each node of the search tree corresponds uniquely to a partition of the indices of the binary 
variables: 

(i) Kq = {j : Xj is set equal to 0}; 

(11) Ki = {j : Xj is set equal to 1}; 

(iii) K 2 = {j : Xj is not assigned a value }. 

Any xjt for A: € is called a free variable. There are, at most, two branches emanating from any 
node, u, in the tree. The two successor nodes correspond to choosing a free variable, at u, and 
forming two new partitions by alternately requiring that k e Kq and k e Ki, that is referred to as 
branching on Xk- 

It will be convenient to let RP denote P2 with the integer restrictions relaxed. Thus, RP is the 
nonlinear programming (NP) problem formed by removing the binary restriction on Xj in P2. If all 
the free variables in the NP optimal solution are integer, then a feasible solution has been obtained 
to P2; otherwise, some non-integer Xk is chosen for branching. 

At node 1 / in the search tree, RP can be written as 

RP" min 2 = JZ + fMVi)] + Pifi, (14) 



i^KiLiK2 i^Ko 

subject to 

bi<Vi<B, ie Ki, (15) 

Xibi <Vi< XiB, i € K 2 , (16) 

E Vi<B, (17) 

i^K\UK2 

0 < Xj < 1, ie K 2 , (18) 

xeX, (19) 



where x = (xi,X 2 , . . . ,3J„), and X is the set of feasible combinations of invested enterprises defined 
by any configuration constraints. 

When an exhaustive search among the nodes of the tree is undesirable or unreasonable (which is 
often the case when n > 30-40), one can use the following standard branch-and-bound procedure to 
solve P2. 




394 



3.1. Description of Algorithm 

Step 1. Set i(To = = 0, = {Ij 2, . . . , n} and solve RP defined by (14)-(19). Go to Step 2. 

Step 2. If all the optimal Xj are integer, then P2 is solved. Otherwise, choose arbitrarily a variable 
Xk with a non-integer optimal value and go to Step 3. 

Step 3. Solve two RPs with alternately updated indices sets: for RP*^' set Kq := AoU{A;}, K[ := 

A' := K 2 \ {A:}; for RP*'" set A" := Ko, := Ki U {A;}, A" := K 2 \ {A;}. Go to Step 4. 

Step 4. Compare the optimal objective values z' and z" of RP*'" and RP*'", resp. If z' < z” then set 
V := Ki := £=1,2, 3; otherwise set u := i/", Ki := Ki, £ = 1,2, 3. Go to Step 5. 

Step 5. If either K 2 = ^ or all the optimal values Xk, k € K 2 , of the solution of RP*' are integer, 
then take the latter for an approximate to a solution of P2. Otherwise choose arbitrarily a variable 
Xk, k e K 2 , with a non-integer optimal value and go to Step 3. 

Finiteness of the procedure is assured since there are only a finite number of free variables at each 
step. 

4. Numerical Tests 



The algorithm described in the previous subsection has been applied to test problems of various 
dimensions and with functions qi{y) of different forms. Namely, the following functions have been 
tested: 

{ Pu 
ft (2 - 
0 , 

( Pi, 

ft(2 - y/hif, 

0 , 

here 0 < g; < Pi. 

For instance, all three types of the fuctions qi{y) were applied to the test problem of Example 1 in 
Section 2. Computation results for functions defined by (20), (21), and (22) are presented below in 
Tables 1,2, and 3, respectively. 



Table 1 



i 


Xi 


Vi 


Xi 


Vi 


1 


1 


25.0354 


0 


0.0000 


2 


1 


60.0850 


1 


60.0850 


3 


0 


0.0000 


0 


0.0000 


4 


1 


15.0212 


1 


15.0212 




J3 = 140 


Eft = 100.142 


B = 99 


Eft = 75.106 




^min ~ 


= 270.042 


^min 


= 285.035 



if 2/ < h, 

if bi<y < 2bi, 
if 26j < y, 
if 2/ < bi, 
if bi<y < 2bi, 
if 26< < y, 
if 2/ < bi, 
if bi < y; 



(20) 



( 21 ) 

( 22 ) 
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Table 2 



i 


Xi 


Vi 


Xi 


Vi 


1 


1 


32.2520 


1 


29.0168 


2 


1 


77.4051 


0 


0.0000 


3 


0 


0.0000 


1 


48.7482 


4 


1 


19.3513 


1 


17.4102 




B = 140 


EPi = 129.009 


B = 99 


= 95.175 




^min ~ 


: 264.287 


^min 


= 278.578 



Table 3 



i 


Xi 


Vi 


— 

Xi 


Vi 


1 


0 


0.0000 


1 


29.7348 


2 


1 


69.8594 


0 


0.0000 


3 


1 


48.8963 


1 


49.8962 


4 


1 


17.4693 


1 


17.8215 




B = 140 


T,Vi = 136.230 


B = 99 


El/i = 97.416 






: 203.135 


‘2’min 


= 222.767 



Observing the Tables one can notice, first, that the results for the piece- wise linear function qi[y) 
defined by (20) are very close to those of Example 1 in Section 2 with the step-wise function qi[y). 
Second, it is clear that the bigger investment sum (El/*) leads to the lower total expense (zmin) 
including the expected fines for the ecological damage. 

4.1. A Special Case 

In practice, when applying the above-developed techniques to the real investment problems, one 
often has to process more general step-wise function qi{y) than that of problem (2)-(5). For instance, 
consider the emergency probability functions of the following form: 





Pi, 


if 


y < bio, 




9i(y) = - 


Qili 


if 


^to ^ 2/ ^ ^*1, 


(23) 




. Qi,ti , 


if 


VI 

VI 

7 





here 0 < 6*0 < • • • < ^ < %u < . . . < 9*i < Pi < 1 . 

It is readily verified that problem (10)-(13) with function (23) is equivalent to the following one: 



subject to 



n ti-l 

maxy~] dijXijj 
»=i j=0 

ti-l 

Xy ^1, i = 1, 2, . . . , 7T-; 

i=o 



(24) 



(25) 
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n U-l 

^ ( 26 ) 

t=i i=o 

Xi, € {0,1}, i = l,...,n; j = 0, 1, . . . - 1, (27) 

where = fi{pi - Qij+i) - > 0 for all ij. 

Problem (24)-(27) is clearly one of the generalized 0-1 knapsack problems, and it can be solved 
exactly by the same greedy algorithm mentioned in Section 2. Indeed, for all i, j, the variables are 
ranked by the similar “prime indexing” so that 

d(i,oy/^(i,oy > c?(i,i)'/^(i,iy ^ , 

where the pairs (1, 0)', (1, 1)', . . . constitute a permutation of the pairs 
(1, 0), (1, 1), ... , (1, - 1), (2, 0), . . . , (2, t 2 - 1), . . . , (n, tn - 1). 

Then one can solve the problem easily by taking X(i^o)' as large as possible without violating con- 
straints (25)-(27), then taking X(i^i)/ as large as possible subject to the value already assigned 
and so on. 
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Summary: The complexity of telecommunications is increasing rapidly in several areas and is 

manifested by rapid technology evolution and convergence, services and value chain convergence, 
regulation of the market, new business structures, and new forms of cooperation and competition. 
There are few, if any, examples of industries undergoing this much change during such a short 
time. This new study area requires the use of tools from operations research such as optimisation, 
game theory, scenarios and system dynamics. It requires also development of analytic skills on core 
competencies and other organisational aspects, use of real options for investment and risk analysis, 
and object orientation for the description of large systems and the management of their complexity. 

1. Introduction 

During the last five years the complexity of telecommunications has increased enormously, and much 
more than during the previous 20 years. A simple estimate indicates that the increase in telecom- 
munications software complexity has been several hundredfolds during these 25 years. The increase 
of complexity is seen in several areas: technology and technology convergence, services and prod- 
ucts, markets and C4 convergence, competition and market regulation, and business organisation 
and role decomposition. In the next chapter I will briefiy describe each of these complexities. The 
increased complexity is also seen as an increased demand for supervision and error reporting of net- 
works. Error detection and error reporting are becoming major cost drivers of the network [1]. An 
estimate indicates that 100 million error reports are generated daily in the international network. 
The increased complexity is coming at the same time as the network has become simpler in the sense 
that it contains fewer switches and provides more capacity per transmission system as before. It 
also contains more and bigger on-line support systems such as subscription databases and operations 
surveillance systems. In Norway the number of exchanges of the telephone network has been reduced 
to approximately one tenth - that is, to about 180 - since 1970 while the traffic has at least doubled. 
This is so because exchanges have become bigger in the backbone network and exchanges in the local 
network have been replaced by simpler remotely controlled units. The potential for decrease in the 
number of major network nodes is much bigger than this. The Norwegian land mobile GSM system 
offers approximately one third of the traffic of the telephone network. Nevertheless it contains less 
than ten exchanges serving all customers. The evolution has had different and conflicting impacts 
on dependability of the network. The increased complexity of each exchange and surveillance system 
reduces the dependability of the telecommunications system - or, equivalently, increases the cost in 
order to maintain the dependability. On the other hand, fewer nodes reduces the amount of equip- 
ment required and thus improves the dependability. Again this requires higher reliability per node 
since the advantage is counteracted by the fact that the fewer exchanges, the more people and the 
more of the country’s economy is affected by a single outage [1]. An outage of a few hours of an 
exchange in the finance centres of New York and Tokyo may cause world-wide economic disasters. 
The problem is that the dependability of the society is becoming at least directly proportional to the 
dependability of telecommunications as telecommunications is becoming more and more involved in 
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all society’s endeavours. 

2. Complexity 

2.1 Technology 

First, technology can be made more complex because integrated circuits reduces both size and cost 
and allows more functions to be implemented in a given space using software technology. In 1987 
the modulation and error protecting system of GSM - now contained in a small portable station - 
could only be simulated on computers as powerful as the Cray. Information technology is entering 
into telecommunications, and telecommunications is becoming part of the kernel of the platforms of 
information systems and is indistinguishable from it. In distributed applications this is very visible 
but difficult to implement because the two merging areas are traditionally so different with regard 
to design methodology, real time requirements and dependability. This merging of technologies 
is one manifestation of what is referred to as convergence of telecommunications and information 
technology. This is a problem which is easy to talk about but difficult to do anything about. It 
involves both industry and education. Second, technology is moving towards more bandwidth, more 
flexible solutions and integration of functionality. More bandwidth means two things. For the long 
distance transport networks the increased bandwidth, where 2.5 Gbit/s or even 10 Gbit/s will be the 
entities in which capacities may be bought and implemented, means that the network will probably 
have huge overcapacity, which in a competitive environment will become cheap. This is already seen 
on international connections. For local networks and access networks more bandwidth will imply 
that entirely new services and service combinations (for example, combination of multichannel TV 
distribution, interactive boadband services and narrowband applications) will be possible. Mobile 
systems offer more flexible solutions than fixed networks. It is (at least experimentally) possible to 
modulate bit rates up to 10 Gbit/s on radio connections. Moreover, it is possible to integrate several 
technologies in the same mobile station within reasonable cost. Integrated DCS 1800 and GSM900 
stations exist. Integrated land mobile and satellite mobile stations are coming. The distinction 
between fixed and mobile access thus disappears (fixed-mobile convergence). Recently, it has been 
reported that the Canadian operator Nortel is developing a method by which at least 1 Mbit/s can be 
transferred on electricity cables to the homes. This opens for integration of different technologies such 
as delivery of electricity and telecommunications on the same infrastructure. Broadband networks 
such as ATM and fibre rings may also solve the problem of offering one transmission system for all 
services, something the ISDN failed to do. Much of which is listed above may represent simplicity 
rather than complexity. However, this apparent simplicity in solutions and services is only possible 
because we are able to handle more and more complex technologies. 

2.2 Services 

Until the beginning of the 1990s, services comprised mainly simple two-party calls.. The main focus 
was the residential market. Since then we have got supplementary services such as call forwarding, 
call barring and call waiting and virtual private networks, and, most important, the web on Internet. 
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The web provides us with a completely new concept of what services are. The network computer, the 
intelligent agent and the Java language are derivatives of this technology. Services are just as much 
bundling and combination of offers as single products. One example is bundling of TV channels, 
local advertisements and subtitles over a satellite network involving several independent business 
systems. Electronic trade combined with payment services, security and anonymisation services, 
physical delivery of goods, delivery of telephony, delivery of electronic newspaper and advertisements 
is another example. Electricity, cable TV and telephony may also be combined as a single offer and 
on one bill. Services may be delivered by one company or be composed of services delivered by several 
specialised companies. Alliances between different actors may be built on single service offers or on 
portfolios of services and other products, resulting in interesting and difficult games of importance, 
dominance and buy-outs. Telecommunications services to the business market may be used by the 
companies purchasing them as raw material or as components for building their own products. In 
this way telecommunications may become a secondary or even ternary product. Electronic banking, 
electronic newspaper and electronic trade are examples where telecommunications services are just 
enablers for the primary service. The telecommunications product may enter the value producing 
system of the company as an element or a support activity in a value chain, as a layer in a value 
network or as a skill in a value shop [2]. The product is different in each of these cases because it 
will see different interfaces. The impact of this is poorly understood by the telecom operators and 
is probably critical to them. In this market it is difficult to analyse what the product really is and 
how it is viewed by the buyer, and how it will evolve. 

2.3 Market 

There are two markets of telecommunications as illustrated in Figure 1: the old market where two- 
party calls are offered to an end user market, and the new market where information is offered to 
a consumer market. The first market is in many countries stagnating because there is no more 
users to connect and no more time for talking. The second market has been here for a long time in 
terms of cable TV distribution and premium rate services but its interactive realisation came as a 
result of Internet. The market is rapidly growing. There is still a third type of market not shown 
in the figure. This market connects content providers and other businesses in order to serve as an 
enabler of interaction between interest groups. This market has many of the characteristics of the 
market representing the new chain in the figure. For the old market the operator requires one market 
facing organisation. The market is simple and easy to segment and operate in. The new market 
is extremely complex because the operator is facing two types of markets for the same product: a 
consumer market which coincides with the end user market of the old chain, and a market consisting 
of content providers and (as will become more evident in Section 2.5) other operators. One of the 
products sold to the content provider will be the size and the segmentation of the consumer market. 
The consumer market may therefore need different segmentation than the end user market. The 
segmentation may be indirect, that is, adapted to each content provider’s needs. This is a skill 
still to be developed by the operators: the segments of a diaper manufacturer and a car dealer are 
different. 
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The old value chain 
Stagnated market? 
«C1» 




The new value chain 
Growing market? 

C4 




Figure 1: Delivery chains 

In order to serve the new market the operator must understand, firstly, how telecommunications is 
used by the content provider as a component in the end product, and, secondly, how the consumers 
interact with the end product. The new chain may also change the balance of power in telecom- 
munications. The chain may be divided into four businesses, the C4 chain: content, computation, 
communication and consumer electronics, or as a set of equivalent activities: create, process, dis- 
tribute and display. The problem for the telecommunications operator, being only responsible for 
the distribution, is to loose the control of the chain. The control can equally well be held by the 
creator of content, a firm processing the information, and, for that matter, the provider of the display 
medium (for example Microsoft and its Teledisc system). For this reason, actors along the C4 chain 
are forming alliances or buying each other in order to retain or obtain dominance of the chain. In 
this way companies viewed earlier to be customers for telecommunication may become operators and 
competitors or vice versa. In this chain telecommunications may become a cost factor more than an 
asset of value for the content provider and other actors in the chain. 

2.4 Competition 

The problem with telecommunications is that it has until recently been an area of monopoly, or at 
best, consisted of cartels, except in niches such as data transmission. The niches have been small and 
of minor business value. This may change with Internet. The problem is to make competition work in 
an area with one previous monopoly operator having virtually 100% of the market share for telephony 
and infrastructure. This is done by letting the supervision of competition reside with a dedicated 
regulator and on terms decided by the politicians. This means that the new market consists of one 
dominating incumbent, competitors competing among themselves and with the incumbent in various 
businesses, and a regulator having no business interests in the market and implementing the rules set 
by politics. The problem with this arrangement is that it comes at a time when telecommunications 
undergo tremendous evolution with regard to technology, market and business structure. This may 
lead to poor understanding of issues like competition across technological boundaries such as between 
mobile and fixed services, between cable TV networks and telephony, between Internet and telephony. 
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etc. Such competition may be partly or wholly supplementary, complementary or substitutional, 
and may change over time. Figure 2 shows an example where telephony and Internet are compared 
using the three parameters technical quality (T), functionality as perceived by the user (F), and price 
(P) [3]. Even though both networks may offer telephony, Internet is the clear winner on functionality, 
and this is likely to be the key competitive parameter. Hence, telephony cannot meet competition 
from Internet by reducing the price. 




Figure 2: Competitive parameters 

This was just one simple example. As the market moves towards technological convergence and C4 
convergence, there will be a number of such examples, some of which may be poorly understood and 
involve undesired possibilities for arbitrage. One such danger lies in the problem that most of the cost 
for telecommunications is related to access, perhaps as much as 80%. This means that competition 
is difficult for services such as telephony since there is little possibility to differentiate the product. 
There the main competitive factor is price and it becomes important to control the major cost factor. 
This cost may also limit the business opportunities for operators renting their access from a strong 
incumbent operator. Access is thus a major concern for the regulator. 

2.5 Business organisation 

The telecommunications business may be divided into several roles, where each role may represent 
a separate business. A company may consist of one or several roles. A model we have developed 
contains ten roles interconnected with one another as shown in Figure 3. The figure contains only 
some of the possible interactions. The composition of the model is based on the observation that for 
each role there are in fact companies doing separate businesses there. The roles are as follows: 

• Infrastructure owners own buildings, ducts, masts, etc., required for cables and equipment. 
Most existing telecom operators are in this role but it also contains other parties such as road 
authorities, cities (e g, the City of Stockholm) and electricity companies. 

• Transport providers own cables, fibres, frequency licenses, satellites, etc. 

• Access providers own copper access lines, fibres, concentrators, frequency licenses, base stations, 
satellites, satellite earth stations, cable TV networks, etc. 

• Platform operators own switches, servers, processors and “intelligent” nodes, etc., for support- 
ing processing, storing, switching, interconnection and routing. 
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• Content hosts offer storage and processing to content providers. 

• Subscription managers keep customer relationships, offer customer care and interface the mar- 
ket. 

• Retailers sell subscriptions, traffic and customer care to the residential market. 

• Clearinghouses offer transaction support, security, information filtering, trust and insurance 
against fraud and liabilities. 

• Content providers create and bundle information such as newspapers. 

• System integrators design solutions for customers and bundle technologies and contents from 
other actors of the model. 




Figure 3: Business system 

Roles may interact vertically or horizontally. Vertically means that one role purchases from another. 
Horizontally means that to similar roles compete. This is illustrated in Figure 4. 

Vertical relationships exist between role 11 (platform operator) and roles 21 and 22 (access providers). 
The external relationship may be regulated. Horizontal competitive relationships (possibly regulated) 
exist between roles 11 and 12 (platform operators) and roles 21 and 22 (access operators). 

3. Operations analysis 

Now we shall look at some strategic methods which may increase the understanding of the system, 
the competition and the impact of regulation. This survey is not complete. However, the literature 
in the field is increasing where the below reference is rather comprehensive though not dedicated to 
telecommunications [4]. 












405 




Figure 4: Relationships 



3.1 Optimisation 

New systems are likely to be designed on distributed platforms supporting software portability and 
vendor independence. This implies that software may be redistributed during processing in order 
to optimise performance and distribute load equally on nodes. Since processing requirements, avail- 
able capacity at nodes and other parameters will be stochastic variables, stochastic optimisation 
and stochastic simulation rather than traditional teletraffic theory may be pertinent methods for 
analysing the performance [5]. The methods are well known but they have, as far as we have seen, 
never before been applied to performance analysis of telecommunications. Reliability evaluation rep- 
resents a similar problem because different distribution methods may have different impact on the 
system’s overall dependability. This may also lead to stochastic graphs for analysing routing of traf- 
fic and addressing of nodes. Other optimisation methods which do not take into account stochastic 
rearrangement and load balancing, may be based on graph theory or linear programming [6, 7]. In 
certain radio systems (military and satellite) even the connectivity of the graph, that is the existence 
of branches, may change over time and thus require stochastic graph theory in order to be analysed. 
The organisational model itself is also becoming so complex and big, with several competing and 
cooperating instances of roles, that stochastic methods are required in order to understand it. Very 
little, if anything, has been done in this area. 

3.2 Games 

Figure 5 shows some of the games in telecommunications. The model contains only a few roles. 
The complexity increases tremendously if we include all roles defined above and take into account 
that there will be a multiplicity of each role. However, the figure shows some of the individual 
games between roles: competition, regulated interconnect, commercial, alliance and organisation. 
The games in each category may be different. In addition the figure shows several other games: 
between regulator and operator roles regulating the competitive arena, between society and the 
telecom providers indicating that services are utilities for the society and thus require rules for 
deployment, availability, etc. [8], between the providers and the market. An important point is that a 
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telecommunications service may be a primary product (mass market telephony), a secondary product 
enabling another product in the market (electronic mail where the primary product is the mail), or 
a ternary product being a production factor of another product (computer-telephone integration). 
These market interactions are poorly understood because some of them are new and coming as a 
result of Internet web and derivatives. 




Figure 5: Relationships and games 



3.3 Scenarios 

This is a tool that has been used in several cases mainly because it is simple and does not require 
the mathematical skills of other areas of operations research. The method identifies how the large 
number, often several hundreds, of external variables changes the future. Example of such variables 
in telecommunications are: price regulation, market for video on demand, attractiveness of film on 
CDs, income per household, time allocated to different activities, fall in oil prices, and earthquake in 
Tokyo or Los Angeles. The key point is that the external variables can be aggregated into a few global 
parameters. I have during the last few years made several scenarios for the different business areas 
of Telenor and each time I have ended up with two - and in one case three - aggregated stochastic 
variables, for example market growth and intensity of regulation, or type of market (ranging from 
traditional to Internet) and value chain binding. Building of four to six scenarios in each case 
will then provide management with information for decision making which is better than belief in 
leading stars and the continuity of history. The procedure has several pitfalls resulting in wrong 
decisions. Therefore it must be done by people experienced not only in scenario techniques but also 
in telecommunications. Scenarios have one enormous advantage normally ignored, and that is that 
the strategist is forced first of all to find parameters infiuencing the future but most important to 
look for dependencies which are not obvious or direct. One simple example is that oil price has no 
direct impact on the performance of Telenor but the indirect impact may be tremendous if the oil 
price falls so much that it leads to economic depression in Norway. A less obvious impact comes 
from the tendency of people to own rather than to lease. This may cause that the market for video 
on demand may not take off and become economic viable because people will buy films on CD or 
tape rather then getting them on the network. Scenarios are also important as support for other 
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methods such as system dynamics, identification of core competencies and analysing risk factors of 
investments. 

3.4 System dynamics 

System dynamics provides cybernetic models of the company or part of its business. System dynamics 
can thus de used to describe the impact of such things as cooperation, competition and regulation 
on the business performance, and is thus an analytic parallel to scenarios. The models are normally 
huge and complex but surprisingly easy to make. One highly aggregated model we made for Telenor 
Network contained about 2000 variables and was built in about six months. Figure 6 shows a simple 
model, in order to visualise the principle, of competition between two operators with regard to their 
market size and their access to skilled people. The method has also been used to evaluate the impact 
of core competency on market performance [9]. One challenge is then to find analytic expressions 
for and to parametrise the competencies. Thus system dynamics is becoming part of organisation 
theory. System dynamics is also closely related to games and may either be made as a game itself 
where several players may compete in the market, as an analytic part of a game, or for simulating 
certain elements of a game. System dynamics is also difficult to use and it is similarly difficult to 
make management trust in it. The reason is often that the model predicts a different future than 
that defined in business models and budgets. However, one important result of the modelling is that 
the model requires values for each variable. In some cases these values are critical for evaluating 
the company and it often turns out that the value is not known or not measured by the company 
or difficult to obtain. These are important feedbacks in order to improve the organisation since it 
makes it possible to identifv which parameters are important for decision making. 




Figure 6: Example of system dynamic model 



3.5 Core competencies evaluation 

A company in telecommunications can be viewed as shown in Figure 7. The current value of the 
company may then be assessed in terms of the product portfolio and the technologies that make it. 
The future value will be determined by the core competencies that the company possesses and the 
technologies it masters [10]. A core competency is a skill that is difficult to copy and difficult to 
manage which the company may possess alone for a long time, and which both shapes the future of 
the industry and differentiates the company from its competitors. 

In this view, it becomes critical to determine the core competencies of the company, and, as has 
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Current value Future value 




Figure 7: Competencies model 



become evident from studies of Telenor, the mastery of the technologies. The mastery of the tech- 
nologies may, of course, be core competencies. However, in an area so heavy in technology as 
telecommunications it is advantageous to treat the mastery of technology separately, in particular 
support technologies as sales and customer care support systems, data warehousing, operation cen- 
tres, etc., which may be components of core competencies or the prerequisite for the existence of a 
core competency. Competencies may be analysed by use of the methods described above in order to 
find which of them belongs to the core. Game analysis, system dynamics and scenarios will require 
insight into the core competencies in order be good models of the company and give correct and 
relevant answers. 



3.6 Investments 

Investments pose several problems in telecommunications. Firstly, competition replaces comple- 
mentarity and supplementarity of services, for example between fixed and land mobile networks. 
Secondly, cannibalisation of products or competition within a company is increasing. This is partly 
because of the above and partly because new services can be implemented on diflferent networks 
providing different characteristics and price, for example narrowband Internet over telephone access 
lines and asymmetric broadband Internet over satellite systems. Thirdly, the technology and the 
market is changing so quickly that an investment is running more and more the risk of being sunk 
cost with no or little return. Investment in video on demand using ADSL may be such a case if 
the video on demand market does not materialise because of other distribution channels for film. 
Finally, historical data for predicting the future are becoming useless because the area is changing 
so fundamentally because of new capabilities or the service is so new that there is no history. E-mail 
on Internet replacing facsimile on the telephone network serves as example since this will result in an 
increase in Internet traffic and a reduction in telephone traflBc but, in order to complicate the issue 
further, the increased Internet traffic will still be present on telephone accesses but at different price. 
Several of the methods for strategic analysis described above will provide valuable input in order to 
shed light on some of these problems. In addition it may be useful, though difficult, to treat the 
investments as real options in order to take into account uncertainty and the value of not investing 
now but wait [11]. Financial risk analysis is part of this. 
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3.7 Object orientation 

Object orientation is a software tool which may be used to describe roles and interactions and reduce 
complexity. An object consists of two things: a public part which is visible to everyone interacting 
with the object, and a private part which consists of the internal structure of the object. The private 
part of the object is hidden from view of other parties. The public part is defined in terms of 
attributes and operations other objects can do on the object and its attributes. In order to avoid 
becoming too abstract I will give a simple example. A platform operator and a transport provider 
(see Figure 3 above) are represented as two interacting objects in Figure 8. The interaction takes 
place via a connect relationship (horizontal line between the two objects). The transport provider 
offers a public view of itself in terms of the attributes offered bandwidth, price list, available capacity, 
and quality of service. 



Platform operator 


Connect 


Transport provider 


Connection technology 


Offered bandwidth 
Price list 

Available capacity 
Quality of service 





Figure 8: Object orientation 

A real model is much more comprehensive and complex than the example may suggest. It may 
be based on the methods called Open Distributed Processing suggested by the International Stan- 
dardisation Organisation (ISO) for systems supporting distributed software platforms. This model 
also contains methods by which companies owning and operating such platforms may be designed. 
What is important, however, is that in an object oriented model we view the companies as simply 
as possible, that is, only the attributes which are strictly necessary are shown in the model. The 
model should concentrate on the roles and not on the actual companies because companies may be 
designed differently even though they do the same business. In this way we may make a model of all 
companies in the same way and uncover similarities and differences in a new way. 

4. Conclusions 

In this paper some of the new complexities of telecommunications have been described. I will again 
stress that this complexity is of new origin and is mainly the result of the development since 1990 
though several of the technologies were developed earlier. The convergence of telecommunications 
and information technology has been one driver for this development. Another driver has been the 
deregulation of the business although this deregulation has been in areas where little evolution is 
expected (telephony) and in one area where it is extremely difficult to assess what will happen (basic 
transport systems). The deregulation has made it easier for new entrants in the market to commence 
operation even though their business could have been established using different technologies. The 
critical problems are, first, that the change is so rapid and, second, that it is so fundamental with 
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regard to technology, products and markets. This require that the whole arsenal of strategic methods 
are required in order to cope with the different aspects of the difficulties. What will be found is that 
several of the methods need adaptation to the business and the new complexity. One example 
is that telecommunications enters other products in a way which is far from being understood. 
Telecommunications is therefore one of the biggest challenges the operations research community 
has ever had. The survey of methods and their applications given above is just a first scratch of the 
surface. 
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Summary: A commonly used model for routing in communication networks is the linear multicom- 
modity network flow (MCNF) model, in which a commodity is deflned by a communication pair, also 
known as an 0-D pair. In the basic MCNF model, there is no restriction on which paths that can 
be used for routing. In telecommunication networks, there are often restrictions on which paths that 
are permitted to use for each 0-D pair, due to time delay and/or reliability requirements. We extend 
the basic MCNF model by including side constraints on paths. Two extended models with different 
objectives are studied. In the first model, the cost function is linear in the flow variables. This 
model can be efficiently solved by a tailored column generation approach. In the second model, the 
routing consideration is based on the longest path used for each 0-D pair and leads to an objective 
function of min-max type. We clarify the relations between these two models and discuss bounding 
procedures for the second model. Some computational results are presented. 
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1. Introduction 

A commonly used model for routing in communication networks is the linear multicommodity network 
flow (MCNF) model, in which a commodity is deflned as a communication pair, also known as an 
0-D pair. Given a set of 0-D pairs and an underlying network consisting of a number of nodes 
and capacitated links, the basic MCNF model concerns finding an optimal routing plan to send 
the demand of all 0-D pairs through the network at minimum cost. The MCNF model has been 
extensively studied, see for example [1] and [8]. 

In the basic MCNF model, there is no restriction on which paths that can be used for routing. In 
communication networks, it is quite common that for each communication pair, certain paths are 
more preferred than others, due to time delay and/or reliability requirements. Below we give several 
examples. 

The transmission time delay of an established path in a communication network depends on the 
links and nodes that this path contains. Generally, a path with many links/nodes results in a larger 
delay. It is desirable that this time delay is less than a certain limit for all paths that are used 
for communication. Furthermore, different communication pairs may have different limits, reflecting 
various priorities classes. The time delay requirements appear as side constraints on paths. A special 
case is when the cost is considered as time delay. 

Another consideration in communication networks is distortion. Distortion also depends on the paths 
that are chosen for routing. When there is a maximal allowed distortion level, it should be taken 
into consideration while assigning paths to 0-D pairs. 

A further important issue is reliability, which can be measured in various ways. One of them is the 
link failure rate. Reliable transmission is a basic requirement in providing high quality of service. 
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Assuming that for each link there is a certain possibility that it fails, the failure rate of a certain path 
can be calculated. For each communication pair, it is desirable that all paths that are being used 
have a failure rate not exceeding a specified limit, which can again be formulated by introducing side 
constraints on paths. Another important parameter connected to the quality of service is the bit 
error rate, which depends on the quality of the links as well as the path length. Generally speaking, a 
path that consists of many links has a higher error rate and causes more end-to-end retransmissions, 
which in turn lead to extra time delay and network load. So only those paths that are sufficiently 
reliable should be used for communication pairs which demand a high degree of quality of service. 
To describe such side constraints, we associate each link and 0-D pair with a generalized length 
value. The length of a path equals the sum of the link lengths along the path. For each 0-D pair, 
a side constraint states that all paths used for routing have a length less or equal to a certain limit, 
which depends on the priority class of this 0-D pair. 



2. Two Extended MCNF Models 



We study two extended MCNF models that incorporate such side constraints. In the first model, 
MCNF-Sl, the cost function is linear in the flow variables, as in the basic linear MCNF model. We 
use the following notation to define MCNF-Sl. The network is described by a set of nodes M and a 
set of directed links A. Link a has capacity Ua. Let C be the set of 0-D pairs. For each 0-D pair 
(p, q) dpq is its demand and TZpq is the index set of all elementary paths that start at p and end 
at q. The path length limit for (p, q) is Spq. The routing cost for sending one unit of flow of 0-D pair 
(p, q) on link a is Cp^a, the routing costs are assumed to be nonnegative. For 0-D pair (p, q) and link 
a, Wpqa is the length value, which we assume to be nonnegative. 

We further introduce a parameter dpqra, defined as below. 

{ 1 if path r € 1tpq uses link a, 

0 otherwise. 

Hence the unit routing cost of path r G TZpq equals Cpqr = Y^aeA^pgraCpqa- 
The following variables are used in the mathematical formulation. 
hpqr = flow on path r G Itpq. 

{ 1 if path r G IZpq is used to send any flow, 

0 otherwise. 

Given these definitions, MCNF-Sl is defined below. 

[MCNF - SI] z*si = n 
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pqeC reUpq 
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(1.6) 



Constraint set (1.1) contains the demand constraints. Constraints (1.2) set the link capacity bounds. 
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The forcing constraints (1.3) force ypqr to be one if path r G 'Rpq is used. Variable ?/pgr is then used 
in the side constraints (1.4) to guarantee that only those paths with a length less or equal to Spq are 
eligible. Coefficients dpq and Ua are assumed to be positive. 

It can be realized that MCNF-Sl is NP-complete, since if the capacity constraints are ignored, then 
the obvious optimal solution is to send whole dpq on the shortest feasible path of 0-D pair (p, q ) . 
This is a constraint shortest path (CSP) problem, which is NP-complete ([3]). 

The main difficulty of MCNF-Sl is to ensure that all paths used satisfy the length limits. We note that 
if we only treat the set of feasible paths for each 0-D pair and let Hpq = {r|r G 'JZpq, ^pqra^pqa < 
Spg}, we can eliminate all p- variables in the model. MCNF-Sl can then be solved as a linear program 
based on 1Zpq. A detailed study of MCNF-Sl and a column generation solution method are given in 
[ 6 ], 

Since flow of a commodity may be split on several paths in a solution, the path with the largest 
length (“the longest path”) is the most crucial one. When the length values concern transmission 
time, this is the path with the largest time delay. In some cases, we are interested in minimizing 
the length of this path for all 0-D pairs. This gives rise to our second model, MCNF-S2, with an 
objective function of min-max type. 

[MCNF - S2] Z*S2 = min Y^pq^c^^iT^a^A^pqra'^pqaVpqrV ^ SUbject tO (1.1) tO (1.6). 

We can also formulate MCNF-S2 by using variable Ipq to represent the length of the longest path 
used by (p, g). Constraint set (1.4) is replaced by the following two constraints sets to deflne Ipq. 

^pqra'^pqaVpqr ^ ^pq ^ '^pqi'^iPf O) ^ ^ and 
^pq ^ Spqi'^iPi Q) ^ 

Then the objective function of MCNF-S2 can be simply written as Zg 2 = minX;(p,g)ec hq- 

Due to its objective function, MCNF-S2 is apparently more difficult than MCNF-Sl. [5] contains a 

study of MCNF-S2 through a modeling point of view. 

3. Relations between MCNF-Sl and MCNF-S2 

Clearly these two models are very closely related. First of all, we may observe that MCNF-Sl and 
MCNF-S2 have exactly the same feasible set of solutions, the difference between them lies solely in 
the objective function. 

Next we show that by modifying the objective function of MCNF-S2 in a proper way, we can use 
MCNF-Sl to produce a valid lower bound to MCNF-S2. The idea is to replace the MCNF-S2 
objective by a linear function in path flow variables. We observe first that T^aeA^pqraWpqa is the 
length of path r and Ipq is the length of the longest path that is used. Furthermore, in a feasible 
solution to MCNF-S2 we have ^pqr/dpq = 1 and hpqr > 0. Hence for each {p,q), we can treat 

hpqr/dpq on different r G Hpq as weights in a convex combination. If we then take the sum of path 
lengths, weighted by hpqr/ dpq^ we get Vpq = Ere^pq ^pqr/ ^pq Ea6>t ^pqra^pqa- Variable Vpq is a weighted 
average path length of (p,q). For any feasible solution of MCNF-S2, it is obvious that ^pg — ^pq • 
This fact implies that if we minimize the sum of Vpq over all 0-D pairs instead of Ipq in MCNF-S2, 
the optimal objective value gives a valid lower bound of Zg 2 - Let MCNF-V denote the problem with 
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this new objective and z'^2 corresponding lower bound. It can be further realized that since 

^52 ~ ~ ^pgeC Y^reUpq f^pqr/dpq JlaeA ^pqra^pqa = 

pqeC 

min ^pq^c 12 reTlpqi^aeA^pqra'^pqa/dpq)hpqri 

MCNF-V is an instance of model MCNF-Sl with routing cost Cpqr = Y^aeA^pqra^pqa! dpq. Hence by 
solving model MCNF-Sl with these values as routing costs in the objective function, we obtain a 
lower bound to MCNF-S 2 . If Wpqa are integral in MCNF-S 2 , this bound can be strengthened to [2:52! • 
Note that hpqr/dpq is not the only weight setting that can be used to provide a lower bound. In fact, 
a valid lower bound is obtained be taking any convex combination of path lengths. The difficulty is 
that we often do not have knowledge of the whole Hpq explicitly. 

Feasible solutions to MCNF-S2 can also be generated via MCNF-Sl. If we fix the variables Ipq to 
Ipq with Ipq < Spq, then Ipq becomes the actual length limit for {p,q). Considering a solution that is 
feasible with Ipq as the length limits, obviously the sum of the longest path lengths in this solution 
will not be worse than Ylpqec which is a valid upper bound. The existence of a feasible solution 
can be examined by solving an instance of MCNF-Sl with Spq = Ipq and arbitrary routing costs. 
Later in this paper, we utilize this observation to construct a primal heuristic for MCNF-S2. 

4. A Lagrangean Relaxation of MCNF-S2 

Other relaxations can be applied to MCNF-S 2 in order to generate lower bounds. Since MCNF-S2 is 
a mixed-integer linear model, an LP-relaxation can be used. A difficulty is that the binary variables 
ypqr are in practice not known a priori. Another weakness of the LP-relaxation is that the bound it 
provides can be very weak; such an example is given in [ 5 ]. Alternatively, a Lagrangean relaxation 
can be used, and it appears that the most appropriate Lagrangean relaxation of MCNF-S2 is to relax 
the capacities constraints. 

Moving the capacity constraints to the objective function by using tTo as multipliers, we obtain the 
dual subproblem DS-S 2 below. 

[DS — S2] = min ^ Ipq -f- ^ ^ ^pqra'^a)hpqr 

pqeC pq^C reTlpq o€>4 
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0 




eC 


Vpqr 


e 


{0,1} 


Vr6?lp„V(p,g) 


eC 



Solving DS-S2 with any fixed tt > 0 yields a lower bound to MCNF-S2. Let LD-S2 be the corre- 
sponding Lagrangean dual problem and Zg2 the optimal value of the LD-S2. Note that DS-S2 does 
not have the integrality property, hence LD-S 2 may provide a better bound than the LP-relaxation. 
What is then the relation between Zg2 and Zg2^ Below we show that these two bounds are actually 
equal: 2^2 = ^S2- 

To show this result, we first prove that DS-S 2 has at least one optimal solution in which the whole 
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demand dpq is sent along one single path for each (p, ^). 

Assume that demand dpq is split on m different paths in a feasible solution. Let be the 

lengths of these paths and Ai, A 2 , be the proportions of demand that are sent on these paths. 
We have Aj > 0 and Aj = 1. Let Cpqr = 'Z^aeA^pgra'^a- The value of the objective function 
becomes 

max(/i, /2, ..., Im) Cpqfi(Aidpg) “h Cpq2{X2dpq) “1“ ... + ^qmi^m^pq)’ 

If we restrict the demand to be sent on one of these paths, which is feasible in DS-S2 since capacity 
constraints are relaxed, and choose the path that minimizes the objective function value, we get a 
new feasible solution with an objective value equals to 

min(/i + ^qldpq, I 2 ^q2^pqt •••» d" ^qm^pq) 

< Ai(/i -f ^qldpq) + \2(l2 + ^q2dpq) + ... + Am(^m d* Cpqm^pq) 

— Ai/i + A 2/2 d" ... d" ^rrJ"m d" ^\^q\dpq d~ ^2^q2^pq d" ... ~h ^m^qm^pq 

< max(/i, /2i •••? ^m) d" ^qli^ldpq) + ^q2{^2dpq) d" ... d" ^qmi^m^pq) • 

Hence by sending the whole demand on a single path that minimizes the objective value, we get a 
solution that is at least equally good, which completes the proof. 

Since choosing path i yields an objective function value that equals li + Cpqidpq = ^pgia'^pga d- 
TfaeA ^pgia'^adpq, the Optimal path of DS-S2 for (p, q) can be obtained by finding the shortest feasible 
path using Wpqa + T^adpq as the link costs, which means that DS-S2 is actually a number of CSP 
problems. 

We summarize this result below. 

Proposition 1 For any tt > 0, DS-S2 has at least one optimal solution in which dpq is carried by 
one single path for each (p, ^). 

We next show that Zs 2 = Zg 2 - Let 'Rpq be defined as in Section 2. As discussed above, solving DS-S2 
for (p,q) is equivalent to finding f = argmin{r G Hpq\T,a€A^pgra'iJiJpqa d- HaeA^pgraT^adpq}, and f is 
obtained by finding the shortest feasible path using Cpqa = Wpqa d- T^adpq as link costs. For (p, g), its 
contribution to the objective of DS-S2 is then Sa€>i^P 9 fa(^ 9 o)- 

To obtain z^g 2 ^ we use z ^2 = HpqecT,reiipg(T,aeA^pgraWpqa/dpq)hpqr to replace the objective 
function in MCNF-S2 as in Section 3 and get MCNF-V, which is an instance of MCNF-Sl. The 
p- variables in MCNF-V can be eliminated by considering only Rpq for each (p, p), and MCNF-V can 
be solved exactly by a column generation, or equivalently, a dual decomposition algorithm, in which 
the capacity constraints (1.2) are dualized. We use DS-V to denote the dual subproblem obtained 
by dualizing the capacity constraints by multipliers tt^. The objective function of DS-V is 

min ^pq^c ^aeAi^pgra'^pqa/ dpq)hpqr d” ^pq^C ^re'^q{^aeA^pgra'^a)hpqr 

— ^pqeC Sr€'^P9 (Sae>4 ^Pgrai'^pqal^pq “h '^a))hpqr 

— ^pqeC Y!,re'^qi^aeA^pgra^pga)hpqrj 

where Cpqa — Wpqa/dpq + TTfl. 

Clearly DS-V also separates in 0-D pairs. For 0-D pair (p,p), the optimal solution is the shortest 
feasible path using Cpqa as the link costs, and the whole demand dpq will be sent on this path. 

Now observe that Cpqa = ^qafdpq. Therefore for the same tt, the optimal path f for each (p,q) 

is the same in DS-S2 and DS-V. Moreover, the contribution of (p, q) to the objective of DS-V is 
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YlaeA^pqfaCpqadpq = Y,aeA^pqfa{^qa) • Heiice Under the same tt, both DS-S2 and DS-V provides 
exactly the same bound. It implies that MCNF-V and LD-S2 have the same optimal value, as stated 
below. 

Proposition 2 The best lower bound provided by LD-S2 equals the lower bound provided by MCNF- 
V, i.e. 2^2 = zfj. 

5. Solving MCNF-Sl by Column Generation 

As pointed out before, if we only consider paths that are feasible with respect to the side constraints, 
MCNF'Sl can be treated as a linear program. Column generation have been applied to the basic 
MCNF model in for example [2] and [7]. The difficulty of applying column generation to MCNF-Sl 
lies on the generation of feasible paths. To solve MCNF-Sl, we propose an exact column generation 
approach. In the path generation subproblem, we consider only paths that do not violate the side 
constraints. This requires solving a CSP problem for each 0-D pair. Although CSP is theoretically 
difficult, it can be solved quite efficiently in practice by specialized algorithms. For solving these 
subproblems, we propose a multilabeling algorithm, which has been shown to be very efficient. For 
a detailed description of the approach, see [6]. 

6. Bounding Procedures for MCNF-S2 

MCNF-S2 cannot be solved directly in the same way as MCNF-Sl. For MCNF-S2, we concentrate 
on bounding procedures. By solving MCNF-V (which is an instance of MCNF-Sl) using column 
generation, we obtain a valid lower bound. Since the optimal flow solution to MCNF-V is also 
feasible in MCNF-S2, we can also compute an upper bound by simply observing the lengths of the 
longest path used for all 0-D pairs. To further improve this upper bound, we use a primal heuristic. 
The idea of the primal heuristic is to solve a sequence of MCNF-Sl instances in order to obtain 
improved feasible solutions to MCNF-S2. The heuristic flrst solves MCNF-V. Assume that for (p, O'), 
the longest path used is of length Ipq. Then the upper bound given by MCNF-V is hq- MCNF- 
V is clearly feasible if we set Spq = Ipq for all 0-D pairs. Next, the primal heuristic attempts to reduce 
Spq and checks whether the problem is still feasible. If the modifled problem is still feasible, then 
the corresponding upper bound is not worse than J2pqec ^pq- A crucial issue is the rule for choosing 
0-D pairs. Let rpq be the shortest path length for (p,q) with Wpqa as link costs, i.e. rpg = min 
{T,aeA^pqraWpqa\r G ^p^}. Then Spq should never be less than Ppq. Let Qpg = Spg - rpg, gpg gives an 
indication on how much we may reduce Spg. The primal heuristic uses Qpg to choose 0-D pairs and 
works as follows. The heuristic chooses a step length which determines the reduction size of Spg, it 
then sorts all 0-D pairs for which gpg is no less than the step size and maintains these 0-D pairs in a 
list. Thereafter it scans through the list and reduces Spg one at a time for each 0~D pair in the list. 
If the problem is feasible after a reduction, then the reduction is actually performed and an upper 
bound is computed, otherwise it sets Spg back to it original value and repeats the same procedure for 
the next 0-D pair. The step size is reduced if no improvement is made when the list is completely 
scanned. A formal description of the heuristic is given below. We let Q be a list of 0-D pairs, s be 
the current step size, Ipg be the length of the longest path used by (p, q) in the current solution, and 
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z be the best upper found by the heuristic. 

1. Initialization: Solve MCNF-V and obtain Ipg. Set z = IZpqechg' 

2. Initial step size: Compute Ppq = Spq — Tp^, V(p,g^) e C. Set s = a max {ppg 1 V(p, € C}. 

3. List construction: For all (p, q) with Qpq > s, insert (p, q) into Q. Sort Q in a descending order 

using Qpq. 

4. Feasibility check: Scan through Q, For each (p, q) £ Q, set Spq = Spq — s, solve the modified 
problem. If it is feasible, compute the new values of Ipq, and if Y^pq^c^g < ^5 update z. 
Otherwise, set Spq = Spq + s. 

5. Step reduction: If z is improved in step 4, go to 3. Otherwise set s = ps. If s < 1, stop. 
Otherwise go to 2. 

In our implementation, we have used a = 0.9 and p = 0.7. 

To check feasibility in each iteration, the heuristic solves a phase-I problem of type MCNF-Sl by 
column generation. 

7. Computational Results 

We present computational results on four test problem groups with different network structures. 
There are eight problems in each of the first three groups and the last group consists of six problems. 
The networks in the first group are complete graphs. Almost all node pairs are 0-D pairs. All 0-D 
pairs have the same demand. In group two, the networks are randomly generated. The third group 
are grid networks. Compared to randomly generated networks, the grid networks have many paths 
between each pair of nodes. The six problems in the last group come from a special MCNF test data 
set, called the Canad problems. The original data set is constructed for discrete capacitated network 
design problems with fixed charges on links ([4]). In our tests, the fixed charges are ignored, as well 
as the partial capacities of 0-D pairs. To make them more computationally challenging, the mutual 
capacities are reduced to about twenty percent of the original values. For all test problems, Wpqa are 
randomly generated integral values and vary by links. For each link, Wpqa are however the same for 
all 0-D pairs. 

In both MCNF-Sl and MCNF-S2, 0-D pairs belong to several priority classes. For all test problems, 
we have used three priority classes and hence three different values of Spq in the problem data. All 
computational tests have been conducted on a Sun UltralO with a 300 MHz CPU. 

We first present computational results of solving MCNF-Sl by column generation in Table 1. 

Table 1 contains the problem size of each test problem and the solution time of the column generation 
method in seconds. The computational results show that the column generation method appears to 
be a promising solution approach for MCNF-Sl. The solution time indicate that it efficiently solves 
all test problems to optimality, even for fairly large instances, such as grid? and gridS. For a more 
detailed presentation and discussion of these results, together with some comparisons to results of 
solving of MCNF-Sl using other codes, see [6]. 
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Complete networks 





coml 


com2 


com3 


com4 


com5 


com6 


com7 


com8 


1^1 


15 


20 


10 


15 


20 


25 


22 


25 


1^1 


210 


380 


90 


210 


380 


600 


462 


600 


|C| 


182 


342 


90 


210 


380 


600 


462 


600 


Time 


0.10 


0.37 


0.05 


0.14 


0.50 


1.33 


0.25 


1.21 






Randomly generated networks 








rani 


ran2 


ran3 


ran4 


ran5 


ran6 


ran7 


ran8 


|V| 


20 


30 


40 


40 


70 


80 


90 


100 


Ml 


380 


500 


1000 


1000 


1000 


1000 


1000 


1000 


|C| 


380 


600 


78 


600 


138 


158 


178 


198 


Time 


0.30 


0.97 


0.52 


2.63 


0.66 


0.68 


0.67 


0.68 








Grid networks 










gridl 


grid2 


grid3 


grid4 


grid5 


grid6 


grid7 


grid8 




25 


100 


100 


225 


225 


400 


400 


625 


Ml 


160 


360 


360 


840 


840 


1520 


1520 


2400 


|C| 


100 


150 


300 


200 


400 


250 


500 


500 


Time 


0.05 


0.42 


1.34 


1.99 


13.62 


5.08 


34.93 


73.26 






Canad networks 












canl 


can2 


can3 


can4 


can5 


can6 






|V| 


20 


20 


20 


30 


30 


30 






Ml 


290 


229 


288 


678 


517 


686 






|C| 


40 


200 


200 


100 


400 


400 






Time 


0.04 


0.12 


0.29 


0.45 


1.45 


3.64 







Table 1: Computational results of MCNF-Sl. 



Computational results of MCNF-S2 are displayed in Table 2. 

In Table 2, \zg 2 \ is the lower bound provided by solving MCNF-V, z is the upper bound given by 
MCNF-V and G is the gap between these two bounds. The best upper bound found by the primal 
heuristic is z and G is the gap between z and \zg 2 \ • Table 2 also shows the total number of feasibility 
checks performed by the heuristic and the solution time in seconds. 

We observe that \zg<^ and z are quite strong for the first three problem groups. Hence by solving 
MCNF-V, we obtain good solutions of MCNF-S2, in some cases the solution is actually optimal. 
The primal heuristic often succeeded in eliminating more than 50% of G. For the last group, G is 
considerably larger, and less improvements are made by the primal heuristic. It is however, unclear 
yet whether the final gap G depends mostly on the lower bound given by MCNF-V or the primal 
heuristic. We also note that the primal heuristic becomes very time consuming for some problems, 
mainly due to their problem sizes. 
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Complete networks 





coml 


com2 


com3 


com4 


com5 


com6 


com7 


com8 


[^52! 


660 


1147 


355 


651 


1201 


1673 


1383 


1803 


z 


660 


1147 


358 


653 


1205 


1674 


1396 


1838 


G{%) 


0.00 


0.00 


0.85 


0.31 


0.33 


0.06 


0.94 


1.94 


z 


- 


- 


356 


651 


1202 


1674 


1388 


1816 


G{%) 


- 


- 


0.28 


0.00 


0.08 


0.06 


0.36 


0.72 


#MCNF-S1 


1 


1 


13 


13 


25 


9 


61 


149 


Time 


0.11 


0.34 


0.47 


2.07 


12.30 


10.61 


49.10 


261.83 








Randomly generated networks 








rani 


ran2 


ran3 


ran4 


ran5 


ran6 


ran7 


ran8 


[^52! 


1290 


2462 


408 


2090 


824 


1230 


1238 


1420 


z 


1316 


2500 


468 


2120 


849 


1250 


1246 


1423 


G{%) 


2.02 


1.54 


14.71 


1.44 


3.03 


1.63 


0.65 


0.21 


z 


1298 


2481 


429 


2102 


832 


1239 


1243 


1422 


G{%) 


0.62 


0.77 


5.15 


0.57 


0.97 


0.73 


0.40 


0.14 


#MCNF-S1 


124 


258 


139 


166 


107 


174 


81 


47 


Time 


78.29 


362.43 


64.33 


441.00 


64.82 


115.35 


57.63 


38.22 










Grid networks 










gridl 


grid2 


grid3 


grid4 


grid5 


grid6 


grid7 


grid8 


[^52! 


1396 


4480 


8595 


7668 


16157 


12532 


26300 


32040 


z 


1418 


4515 


8663 


7774 


16424 


12684 


26518 


32392 


G{%) 


1.58 


0.78 


0.79 


1.38 


1.65 


1.21 


0.83 


1.10 


z 


1406 


4497 


8622 


7720 


16323 


12607 


26404 


32229 


G{%) 


0.72 


0.38 


0.31 


0.68 


1.03 


0.60 


0.40 


0.60 


#MCNF-S1 


89 


106 


252 


258 


826 


395 


923 


1157 


Time 


3.79 


36.01 


230.14 


635.89 


11421.66 


3433.22 


29879.91 


117907.94 






Canad networks 












canl 


can2 


can3 


can4 


can5 


can6 






r^s 2 i 


313 


1931 


2117 


864 


4455 


4168 






z 


362 


2019 


2434 


1023 


4834 


4941 






G{%) 


15.65 


4.56 


14.98 


18.40 


8.51 


18.55 






z 


345 


1978 


2270 


973 


4728 


4705 






G[%) 


10.22 


2.43 


7.23 


12.62 


6.13 


12.88 






#MCNF-S1 


67 


244 


525 


282 


1048 


1313 






Time 


4.03 


45.66 


265.22 


221.62 


2723.95 


7140.34 







Table 2: Computational results for MCNF-S2. 
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8. Conclusions and Future Work 

In this paper we extend the basic MCNF model to incorporate side constraints on paths, motivated 
by their practical applications in communication networks. Two extended models with different 
objective functions are presented. Their relations are studied. We show that MCNF-Sl can be 
efficiently solved by a column generation algorithm and describe bounding procedures for MCNF-S2 
with computational experiments. 

Generation of better bounds for MCNF-S2 requires further research. Future work will include to 
investigate whether better lower bound is available and its computation, as well as development of 
stronger and more efficient primal heuristics. 
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Optimization of Transmitting Powers and Locations of Base 
Stations in Cellular Radio Communication Systems 



Saku Hieta, Nokia Research Center 
Emanuel Melachrinoudis, Northeastern University 

Summary: An optimization model for determining optimal or near-optimal locations and trans- 

mitting powers for a fixed number of base stations in a cellular communication system is presented. 
Traffic demand is assumed to have a stationary non-uniform distribution and is modeled using a 
demand grid. Demand is allocated to the base station receiving the strongest signal. Physical re- 
strictions to the location of base stations, such as rivers, landmark buildings and parks are excluded 
as infeasible areas. Three objectives are considered: (a) maximization of covered traffic demand de- 
fined in terms of signal strength and co-channel interference, (b) minimization of blocking probability 
and (c) minimization of total cost. These are subsequently combined in a non-linear and non-smooth 
goal programming objective function which is optimized using the Nelder-Mead down-hill simplex al- 
gorithm. The model is implemented using Matlab and illustrated in a fictitious microcellular system 
for the Boston downtown area. 

1. Introduction 

1.1 Cellular Communication Systems 

The basic elements of a cellular communication system are mobile stations, base stations and mobile 
switching centers. A mobile station, usually a hand-held mobile telephone, is the interface between 
a user and the communication system. Base stations are fixed servers that provide access to the sys- 
tem for mobile stations located within their coverage area. The base stations consist of transmitters, 
receivers and antennas for radio communication and a control unit responsible for the control, mon- 
itoring and supervision of the traffic. The mobile switching centers (MSCs) control the system and 
in mobile telephone systems they provide the connection to the public switched telephone network. 
Base stations are connected to MSCs with cables or microwave links. 

The capacity of cellular systems is based on the concept of frequency re-use: same radio channels 
may be re-used in several small geographic areas, called cells, each served by one base station. This 
is possible by dividing the available channels to subsets, that are assigned to cells located sufficiently 
apart from each other to avoid interference. The cells that use the same subset of channels are 
called co-channel cells and the resulting interference is called co-channel interference. In theory, the 
capacity of a cellular system can be increased indefinitely by using smaller cells and this scalability 
has made it possible to accommodate the fast growing demand for mobile communication services. 
In present cellular systems, a mobile station monitors signal level received from surrounding base 
stations and when a call is initiated, the system designates a channel to establish a communication 
link between the mobile station and the base station providing the strongest signal. If there are no 
available channels, the call is blocked and the caller will hear a busy signal. During an on-going call, 
a mobile station might move to a position where the signal provided by the base station is no longer 
strong enough to guarantee adequate voice quality. If another base station can provide stronger 
signal, the channel and base station assigned to the mobile station are changed in an action called 
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hand-ofF. In case that there are no available channels for hand-off or any base station cannot provide 
sufficient signal level, the call is dropped, i.e. abruptly terminated. 

1.2 Cellular System Planning 

The objectives of cellular system planning are to maximize covered demand within the service area, 
to minimize blocking probability and dropping probability of calls made within the area and to 
minimize the total cost of building and operating the system. Coverage can be defined in terms 
of signal strength only or in terms of both signal strength and interference. Previous research on 
optimization of base station locations has either concentrated on signal strength only ([1], [2], [3]) or 
simplifying assumptions on channel assignment have been made to account for interference ([4], [5]). 
The model presented in this paper includes a channel assignment heuristic, which allows realistic 
evaluation of coverage in terms of both signal strength and interference. 

Evaluation of blocking probability requires information on demand distribution and channel assign- 
ment. Using the above mentioned channel assignment heuristic, it is possible to evaluate also the 
blocking probability. This is a distinctive feature of the model proposed in this paper, the authors 
are not aware of any previous research including such a feature. Dropping probability is not included 
in the presented model, as it would require a more detailed model of the demand than is used here. 
Cost minimization has been used as an objective by some authors ([6], [7]), but it has not been com- 
bined with system performance objectives. This paper describes an integrated approach, including 
both the engineering aspects of the problem as well as its economical aspects. 



2. Mathematical Model 

This section describes a mathematical model for evaluating cellular system performance in terms of 
coverage, blocking probability and system cost. First, some necessary definitions are made and then 
an algorithm for channel assignment and performance evaluation is presented. 



2.1 Definitions 



Service area is defined as A = {(x,y) e I 0 < a; < Xmax^O < y < ymax}- Traffic demand within 
the service area is modeled with a grid of demand points (i, j), i = 1, . • . , Wx? J = U • • • ? jmax, with 
coordinate locations 



{{^ijjUij) ^ ^ I ^ij ~~ 





Umax } • 



Associated with each demand point (i,j) is the estimated busy-hour traffic 



where is the mean arrival rate of calls and /Zy is the mean holding time of calls. Parameters 
and Hij are assumed to be independently defined for each grid point, neglecting the mobility of the 
demand. Calls are assumed to arrive according to Poisson process, but the holding time distribution 
can be arbitrary. 
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The number of base stations to be located in the service area is constant, denoted by n. Base station 
k is described by vector 

(xk,yk,Pk) e D,k = l,...,n, 

where (xk.Vk) € A are the coordinates of base station k and pk € [0,Pmax] is its transmitting power. 
The coordinate values are not restricted to points on the demand grid. Additional restrictions to the 
design space D are modeled by defining the availability matrix A = [a(i,j)], where 



aihj) 



1 if base stations can be located in grid cell (z, j) 
0 otherwise 



Frequency channels are denoted by I = 1, . . . , m, where m is the number of channels in the system. 
Each channel can be used by several base stations, but only once by a given base station. Channel 
assignment is defined with variable '‘^kh such that 



Ukl 



1 if channel I is assigned to base station k 
0 otherwise 



The strength of the signal transmitted by base station k and received at demand point (z, j) is 
denoted by , and calculated using the exponential pathloss model [8] 



s 



k 

ij 




if d < do 
ifd>do ’ 



(1) 



where d = yj{xk — Xij)'^ + {vk + Vij)^ is the distance between base station k and demand point (z, j), 
Po is the received power at reference distance do and i/ is pathloss exponent. 

For each demand point (z, j) a demand allocation variable Vij indicates the base station to which the 
demand point is allocated: 



if Sij < Slim for all A; = 1, . . . , n 






( 2 ) 



where sum is the signal strength threshold, required for adequate voice quality. The signal strength 
received from the allocated base station is then denoted by 



1 0 



for {(t, j) I Vij > 0} 
for I Vij = 0} 



( 3 ) 



This simple base station allocation rule does not take into account the mobility of users or the 
possibility of splitting the traffic of a demand point to multiple base stations. 

Given channel assignments Uki and signal strengths and co-channel interference on channel I 
at demand point (z, j) can be calculated as the worst case signal to interference ratio: 









E(4)+^’ 

ke^ 



( 4 ) 
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where ^ = {k \uki = 1 and k ^ } and e is a very small number, used for avoiding division by zero 

in numerical calculations. Total co-channel interference at demand point (ij) is defined as 

(5) 

This is a worst case approximation, because it is possible that a mobile station at demand point (i,j) 
is actually using a channel with less than maximum interference. A co-channel interference threshold 
ipiim is required for adequate voice quality. 

In order to calculate coverage, covered set C is defined as 



C = {(i, j) I Vij > 0 and (pij > (plim}. 

Coverage can then be defined as the percentage of demand that lies in covered demand points: 

Pij 

(iJ)eC 

® • 

Z. Pij 

all{ij) 

Blocking probability bk is calculated using the recursive version of the Erlang-B formula [9]: 



( 6 ) 



bk(0) = 1, 

bkirrik) = 



bkjmk - 1) 



-, for rrik > 0, 



mk/pk-^bk(mk-iy 

where ruk = '^ki is the number of channels assigned to base station k and 



(7) 



Pk= X! Pij (8) 

is the traffic demand assigned to base station k (in erlangs). System performance in terms of blocking 
probability will be measured as the maximum blocking probability among all base stations : 



bmax — bk- (9) 

The channel assignment heuristic uses the attainment of a blocking probability threshold as a ter- 
minating condition. This threshold will be denoted by bum- 
The total cost of a given system configuration is calculated using the cost model 

C= 53 (c6 + c,(x/fe,y*) + Cem* + (^p*), (10) 

k—l,...,n 

where Cb is fixed cost of a base station, Ca is cost of base station site {xk, yk)^ Cc is cost of a frequency 
channel and Cp is cost of power. All costs should include both investment and operational expenses 
and they should be annualized appropriately. The site cost is defined using matrix representation 
Cg = [cs{ij)], using the same grid as for demand. The cost of a base station is determined based on 
the nearest grid point. 
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2.2 Channel Allocation and Performance Evaluation Algorithm 



The following algorithm determines channel allocation and evaluates system performance: 

STEP 1 Calculate all signal strengths and determine demand allocation 

For each A; = 1, . . . ,n and each = 1, . . -jimaxJ = 1, • • • Jmax 

Calculate from Equation 1 
Calculate Vij from Equation 2 

STEP 2 Calculate signal strength from assigned base station and traffic demands for stations 
For each {i,j),i = 1, . . • , W, j = 1, • • • ,jmax 
Calculate from Equation 3 
For each k = 1, . . . , n 
Calculate pk from Equation 8 

STEP 3 Determine the set of candidate base stations for next channel assignment 
Determine = {A; | 6* > bum and nik < m] 

If K is an empty set, goto STEP 9, else goto STEP 4. 

STEP 4 Select base station for next channel assignment 
Set k* = arg max bk 

^ keK 

STEP 5 Determine the set of candidate channels for assignment 
L = {1\ Uk\i = 0 } 



STEP 6 Calculate co-channel interference for candidate channels 

For each I G L and each (i, j) 

Calculate (p\j assuming Uk*,i = 1, using Equation 4 

Calculate ipij assuming Uk*,i = 1, using Equation 5 

STEP 7 Calculate co-channel interference ratio for candidate channels 

Determine interfered set ^ \ (fij < ipiim] 

Calculate ratio ^ of demand pointsji, j) € 

^ total number of demand points 

STEP 8 Select channel for assignment and update blocking probability 

Set I* = arg min ib 

® leL ^ 

Set Uk*,i = 1 
Set TTijfc* = ruk* + 1 



Calculate bk* from Equation 7 
Goto STEP 3 



STEP 9 Calculate final values of performance measures a, bmdx and c 
For each / = 1, . . . , m and each (z, j) 

Calculate from Equation 4 
Calculate ipij from Equation 5 
Determine covered set C 
Calculate a from Equation 6 
Calculate bmax from Equation 9 
Calculate c from Equation refeq:c 
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3. Optimization 

3.1 Problem Formulation 

Using the mathematical model presented in the previous section, cellular system planning problem 
can be described as a multiple objective optimization problem 



max Zi 
min Z 2 
min zz 
subject to 



a{{xi,yi,Pl), ■ ■ ■ ,{Xn,yn,Pn)) 

^max{{Xiy yi,Pl), . . . , (Xn, J/niPn)) 

i{xi,yi,Pl),---,{Xn,yn,Pn)) € D. 



Taking a goal-attainment approach to multiple objective optimization, we define target values U for 
the performance measures Zi and define normalized deviations d, for them: 

. fr, h-zi. 

di = max{0,— 
di = 

da = max{0,^^}, 

where z“^^^ = = 1 and z^^^ = n(cb H- max Cs{iJ) -h CcTU + CpPmax) are the maximum values of 

aH{iJ) 

the performance measures. 

The problem is now formulated as a single objective nonlinear program with the weighted sum of 
deviations as the object ve function 



min 2 : = u;idi -f W 2 d ,2 + 

subject to ((a;i, 2 /i,pi),...,(xn, 2 /n,Pn)) € A 

where Wi € [0, 1] are preference weights representing the relative importance of attaining the cor- 
responding goal. Finally, in order to use an optimization algorithm suitable for non- constrained 
problems, the problem is formulated as 



minz = Widi-\- W2d2 + w^ds -\-Wfdf, 

where ly/ > 0 is a feasibility weight and df is the deviation function 

df=l^ if ((®l>J'l>Pl).----(a;n>yn,Pn)) e 
[ 1 otherwise. 

3.2 Optimization Algorithm 

Based on previous research ([2], [4]), one of the most effective and robust methods for optimization 
of the type of problem at hand is the Nelder-Mead downhill simplex method [10]. It is a direct 
search method, making use of the geometrical concept of simplex. A simplex in n dimensions is a 
polyhedron consisting of n + 1 vertices. In two dimensions, simplex is a triangle, in three dimensions 
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it is a tetrahedron. In Nelder-Mead algorithm, an initial simplex is first generated in the design space 
of the problem. At each iteration, the values of the objective function at the vertices of the simplex 
are evaluated, and the simplex is then transformed using heuristic refiection, contraction, expansion 
and shrinking steps. 

The Nelder-Mead algorithm will always converge to a local optimum, but the quality of the solution 
and the rate of convergence depend on choice of the initial simplex, on target values for performance 
measures, on preference and penalty weights and on parameters of Nelder-Mead algorithm. The 
computational example of next section demonstrates how these parameters can be chosen. 

4. Computational Example 

4.1 Input Data 

The optimization model introduced in the two preceding sections was tested using a fictions cellular 
system for Boston, Massachusetts, as an example. The downtown area of Boston covers a peninsula 
with dimensions of approximately 1800x2400 meters. The total traffic demand from this area during 
busy hour was assumed to be 2400 erlangs. The gridsize of the demand grid was chosen to be 120 
meters, giving a grid of 15x20 gridcells. The gridcells were classified according to the expected traffic 
demand, ranging from 0 to 20 erlangs per gridcell. 

The gridcells were also classified according to the cost of cell sites. The site cost was assumed to 
be lowest in highway cells, where basest ation antennas can be attached to traffic signs and such 
structures. The highest cost was assumed for commercial areas where the cost of roof space is 
limited. Some gridcells, such as all water cells and parks were considered unavailable. The input 
values for design space dimensions, number of frequency channels, threshold values for signal strength, 
co-channel interference and blocking probability and cost parameter values are shown in Table 1. 
The initial solution was chosen in an intuitive manner, placing base stations in locations where the 
surrounding gridcells had high traffic demand. Trial runs were then performed to compare a few 
alternatives created this way. The number of base stations was chosen to be 14 because the trial 
runs indicated that this number was required to achieve 95% coverage. 

The preference weights were all set to 0.33 because all of them were assumed equally important. 
The feasibility weight was also set to 0.33 to allow the optimization to ’’move through” infeasible 
areas. Target values for demand coverage and cost were set to 100% and 1.4 Million USD, i.e. the 
best achievable values. Target value t2 for blocking probability was set to 2%, the same value as the 
blocking probability threshold bum oi channel assignment algorithm, since the algorithm will force 
blocking probability below the threshold if there are enough channels, but it will not continue any 
further. Range equalization factors were set according to the full range of attainable values of the 
objectives. 

4.2 Optimization Results 

Based on the trial runs it was clear that a single run of Nelder-Mead algorithm was not enough to 
bring the solution near the global optimum. This tendency of the algorithm to converge fast to a local 
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Input 


Unit 


Value 


Xmax 


m 


2400 


ymax 


m 


1800 


Pmax 


W 


100 


m 


- 


800 


^lim 


dBm 


-100 


V^lim 


dB 


9 


^lim 


% 


2 


Cb 


USD/5 yrs 


100000 


Cc 


USD/5 yrs 


12.5 


Cp 


USD/5 yrs 


12.5 



Table 1: Input values 



Run 




z 


a 


IMl 




Initial 


- 




0.76 


0.017 




1 


734 


0.1100 




0.017 


5861100 


2 


534 






IjQQI 




3 


628 










4 


579 










5 


677 


0.0987 


0.97 




5750800 


6 


965 


EES9 


0.97 




5750800 


7 


536 




0.97 




5750900 


8 


394 




Blgl 




5750900 


9 


591 


E!Bi 






5750900 



Table 2: Improvement achieved by restarting Nelder-Mead algorithm 



optimum is caused by the shrinking step, that decreases the simplex when improving vertices cannot 
be found by reflection or contraction. Shrinking is unavoidable because the guaranteed convergence 
of the algorithm is based on the shrinking step. A way to improve the solution of a single run was 
found by starting the Nelder-Mead again using the final solution from the previous run as the initial 
solution for the next one. This means that a new ’’large” initial simplex is constructed and new 
improving vertices can be found by reflection and contraction steps, unless a global optimum has 
been reached. To further improve the performance of the new run, the initial simplex was created by 
using pseudo-random number generation and changing the seed of the generator for each new run. 
Restarting the Nelder-Mead algorithm from the final solution with a new initial simplex for each run 
improved the optimization results considerably. Table 2 shows the improvement during 12 runs for 
the case with 14 base stations. 



4.3 Sensitivity Analysis 

Since the optimization results depend on the chosen initial parameters, sensitivity analysis was 
performed to determine the effect of changing some of these parameters. The effect of the number of 
base stations was studied by performing optimization runs with restarting with 7 to 15 base stations. 
A very clear correlation between the number of base stations, demand coverage and total cost was 
found as expected. 
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The effect of different initial solutions was also tested, and the results showed that the choice of a good 
initial solution is crucial in order to achieve good final solution. However, the optimization almost 
always resulted in considerable improvements to the initial solution, proving it to be a valuable tool. 
Setting the preference weights was found to be maybe the most difficult task in using the algorithm. 
Because of the non-convexity of the problem, a seemingly logical setting of preference weights can 
result in worse solutions than arbitrarily chosen weights. Further research is needed to determine if 
an effective strategy for setting the weights exists and whether preference weights should be used at 
all. 

Sensitivity analysis was also performed regarding the target values for performance measures. The 
results were mostly intuitively acceptable, relaxing targets in one of the objectives usually improved 
the achieved peformance in other objectives. However, non-convexity caused some surprises also 
regarding the target values. All of the results clearly demonstrated the importance of performing 
sensitivity analysis. 

5. Conclusions 

In this paper we have introduced a mathematical model, that allows concurrent evaluation and 
optimization of coverage (including co-channel interference), blocking probability and cost of a cel- 
lular system. This is a considerable improvement to the previous research, where only one or two 
performance measures have been considered simultaneously. 

The introduced method for concurrent performance evaluation should be seen as a framework, that 
can be further improved by using more advanced channel assignment algorithms and more detailed 
models for traffic demand, radio propagation and cost assessment. 

The non-linear goal attainment approach to multiple criteria optimization was found to be suitable to 
cellular system planning. However, the practicality of the approach needs to be tested in real world 
situations. Especially the determination of preference weights and target values of the objective 
function, and sensitivity analysis of the achieved solution need to be studied further. 

Nelder-Mead down-hill simplex algorithm proved to be a good choice for this problem, as previous 
research suggested. Experimentations showed that the optimization yields intuitively acceptable 
results with reasonable computational effort. However, without comparable results from other meth- 
ods, it is difficult to evaluate the efficiency of the optimization. Further research is needed to provide 
such comparisons and to further develop the optimization, for example by using simulated annealing. 
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A Ring Network Design Problem 



M. Henningsson, K. Holmberg, M. Ronnqvist, P. Varbrand, Linkoping University 

Summary: We discuss the problem of designing a telecommunication network with the survivability 
requirement that the network should be composed of connected rings of links. The network design 
problem is then to choose links from a given network, and compose them into a number of rings. 
Furthermore, the rings should be connected at certain transit nodes. The traffic between rings may 
pass through other rings. Each ring is associated with a certain fixed cost. We describe the problem, 
modeled as a linear integer programming problem, and a heuristic solution method, based on column 
generation and Lagrangean relaxation. 

Key Words: Network design, rings, column generation, Lagrangean relaxation, heuristic. 
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Telia AB. 

1. Introduction 

The task of designing new networks within the area of telecommunications is today a very frequently 
encountered and important problem. With the increased usage of Internet and multimedia, the new 
networks are required to have both high capacity and high survivability. One way to create a suitable 
network is to base it on a ring structure. In a ring there are always two ways to send traffic, clockwise 
or counter-clockwise. This means that traffic can be sent even if a single link or node breaks down, by 
using the opposite direction of the ring. The rings are so called Self Healing Rings (SHR) where some 
capacity is unused until the event of a link or node failure. The technology is based on Synchronous 
Digital Hierarchy (SDH) standard for optical transmission. 

In this paper, we will discuss a situation where demand should be sent across a network which only 
consists of rings. OADM (Optical Add Drop Multiplexor) equipment are used to add (or drop) 
demand to (from) the rings, and also to switch demand from one ring to another. 

The rings are constructed from links in an existing network, and the task is to decide the topological 
design of the rings. A commodity is defined by a pair of demand nodes, and must be sent along one 
path. If the path includes more than one ring, the commodity must be switched from one ring to 
another at a so called transit node. The capacity required on each of the fibers included in the ring 
is equal to the total sum of traffic on the ring, and independent of the routing in the rings. 

The cost of a ring depends on its length and capacity. The cost for an OADM equipment is fixed and 
occurs whenever a ring is connected to a demand node or a transit node. The task is to determine 
the ring structure, the routing strategy for all commodities, and where to install OADM equipment. 
We have not found this problem in the literature, but related problems can be found in [9], [7], [3], [8], 
and [10]. 

2. Mathematical Formulation 

We have a given network with a set of nodes, i = 1, . . . ,N, and a set of undirected links (i,j) (i < j), 
with length lij. For each commodity. A: = 1, . . . , C, we have an origin o(k), a destination d(k) and a 
demand 6^. Let Cf = {k : o{k) = %} and = {k : d{k) = i}. There are T transit nodes, numbered 
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from 1 to T. We will use index I for transit nodes, and index i for nodes in general. Note that transit 
nodes usually have a demand. 

A ring is a sequence of links, where each ring must not use any link or pass any node more than 
once. The total length of a ring must not exceed L, and the number of nodes connected to a ring 
must not exceed N^. We will initially assume that all feasible rings are known a priori, indexed by 
r = 1, . . . , i?. The topology of a ring is defined by the following coefficients. 

air = 1 if ring r passes node i, 0 otherwise 

6ijr = 1 if link (i,j) is included in ring r, 0 otherwise 

Each ring has the capacity f/, and a fixed cost, consisting of the sum of costs for the links included 

in the ring. The total fixed cost for installing capacity U is 

fr = ^2 

(*J) 

where and and are constants. Several identical rings may be placed upon each 

other, yielding a staircase cost structure. However, in long term planning the design must be stable 
with respect to changes in demand and capacity. Therefore, it is more suitable to approximate the 
staircase costs with one fixed cost plus a linear cost depending on the capacity. The linear capacity 
costs depend on the length of the ring, and the coefficients are given as = E(x,j 7 ) where 

cf. = uffj, u = v' I U and u' is a factor, less than one, reflecting the advantage of building several 
identical rings. Note that the capacity of a ring is no longer limited to U, since a ring in our model 
could consist of any number of identical rings. 

In addition to ring costs, there are fixed costs, c^, for the equipment (OADM) obtained when 
connecting a certain node to a certain ring. Commodities sharing the same origin or destination can 
use the same equipment. Finally there is a linear cost, c^, for all traffic leaving/entering a ring. 
Since the total demand is known, the total infiow/outflow at demand nodes is constant. Therefore 
only transit flows affect these costs. 

2.1 A Model with Given Rings 

We use the following variables. 

yr — ^ if ring r is used, 0 otherwise 

= 1 if o{k) and d{k) both are connected to ring r, 0 otherwise 
xlk =1 if only o{k) is connected to ring r, 0 otherwise 
xjf = 1 if only d{k) is connected to ring r, 0 otherwise 
Xrk = 1 if commodity k uses ring r as a transit ring, 0 otherwise 
hir = 1 if node i is connected to ring r, 0 otherwise 

= 1 if commodity k leaves ring r at transit node /, 0 otherwise 
= 1 if commodity k enters ring r at transit node /, 0 otherwise 
The mathematical formulation is given below. 

min Y. frVr + Y, + 4k + 4k + 4k) + <4 hir + Y 

r r,k t,r l,k,r 
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\/k 
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Constraints 1 are the demand constraints, and ensure that the correct amount is sent between the 
origin and the destination. Constraints 2 and 3 control transition between rings. Constraints 4 
ensure that a fixed cost is paid when using a ring. Constraints 5 and 6 ensure that fixed costs for 
node connection equipment are paid. Constraints 7 limits the number of nodes connected to a ring, 
and constraints 8 ensure that no connections or transit are possible for rings not used. The model 
has 6RC + NR 4- R variables and AC R-\- IbRC + NR constraints for T = 2 (which is the case of 
our main interest). 

In order to strictly enforce that each ring should be connected to at least P transit nodes, we can 
add the following constraints (which however, in our examples with P = 1, seem to be redundant). 
Y^hir>P Vr (10) 

One might also require that the network should be 2-node-connected. For the case with two transit 
nodes, we set P = 2, which means that both transit nodes must be connected to each ring. 

2.2 Column Generation Aspects 

The main effect of the coefficients a is to eliminate variables that may not be nonzero (for example, 
one may not connect a certain node i to a certain ring r if the ring does not pass that node, Oir = 0). If 
all rings are not known/included in the model, one might consider the traditional column generation 
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approach, where one is looking for “columns” not included, but with negative reduced cost. ^From 
this viewpoint, all feasible rings are considered to be numbered a priori, and are not changed. The 
question is which rings to include, i.e. for which r we may set yr = 1. This means that the a- and 
e-coefiicients are completely fixed, and that we will include (and possibly exclude) different sets of 
such coefficients, and then decide if to use them with the help of the variables x, w and h. 
Another possible viewpoint is that the rings are not constant, but may change. In this case, we do 
not assume an a priori numbering of the rings. Instead we include in the model a fixed number of 
rings, numbered from 1 to R, where R is an upper bound on the number of rings that is used in the 
optimal solution. Note that R R. Also here yr determines if ring r should be used or not, but 
ring r is not defined a priori. Instead it may be changed to suit our purposes, simply by considering 
a (and e) to be variables rather than constants. 

We concentrate the influence of a to the y- variables, which is suitable from a column generation point 
of view, since the items that can be generated are the rings, which correspond to the i/- variables more 
than to any other variables in the model. Note that we are still planning to solve the model for fixed 
values of a and e. We then need a separate model for updating the values of a and e, with the 
support of dual information. 



2.3 Ring Generation 



Assuming that all rings are known is not very realistic for large scale applications. Therefore one 
must be able to generate rings. Each ring should form a cycle, so if the nodes to be included were 
fixed, it would be a Traveling Salesman Problem (TSP). However, instead there is a reward from 
including a certain node, which yields a Price Collecting TSP- type of problem, PCTSP, see [2], [4]. 
The variables in this problem are the coefficients a and e. The objective function, to be specified 
later, consists of one part with costs for the links, depending on the length of the links, and one 
part with rewards for visiting nodes, Tti (< 0). The index r is here the number of the ring that is to 
be constructed, and can be omitted when solving the problem. 
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(RGP) 



Constraints 11, 12 and 13 are the corresponding TSP constraints, while constraint 14 states that 
each ring must be connected to at least one of the transit nodes and constraint 15 enforces a maximal 
length, L, of the ring. The optimal solution to the ring generation problem yields a new ring, (a^, e^.). 
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to be included in the main model. 

There are several possible solution approaches for this problem. Without constraint 15, it can be 
reformulated as an ordinary asymmetric TSP with twice as many nodes. Therefore a natural approach 
is to apply Lagrangean relaxation to the single constraint 15, [1]. (In our application constraint 15 is 
often redundant.) We can also use other relaxations of the types that are used for ordinary TSP:s, 
as for example the well known assignment problem relaxation or the 1-tree relaxation. We will also 
consider heuristic solution approaches for this problem. 

3. Column Generation Procedures 

We will now discuss a (heuristic) column generation approach for the problem. We first specify the 
information to be used when generating new columns. What is a “column” that we may generate? 
How can we get reduced costs that gives the best column? Since we already have considered the ring 
generation problem, RGP, a natural answer to the first question is that a column corresponds to a 
ring. There are two possible approaches for calculating reduced costs, either based on LP-duality or 
based on Lagrangean duality. 

A complication is the dependence between the different variables. If a certain ring is not used, there 
can be no node connections to this ring and there can be no transit to or from it. These variable 
dependencies are easier to handle in a Lagrangean approach, since there is a possibility of dualizing 
only some of the constraints, while other constraints may be kept. Therefore, we will not use the LP 
approach. 

3.1 Reduced Costs Based on Lagrangean Duality 

The general approach is to construct a Lagrangean relaxation of the model, by assuming fixed 
Lagrange multipliers (dual variables), and then look for the most negative reduced cost. Then we 
consider a and e variable, and find the values that yield the best reduced cost, i.e. the best ring. 
The use of Lagrangean duality enables us to keep all constraints that prohibits any usage of ring r 
as long as 2 /r = 0 ) so we can concentrate the total effect on the variable In other words, for ring 
r we have the choice of letting Vr = 0 (not using ring r at all and not obtaining any gains from it) 
or letting yr = L which infiicts the fixed cost /r, but also enables the usage of ring r. What remains 
then is the possibly non-trivial decision of how to use ring r in the best way, or rather to determine 
the total gain of including ring r. This question is obviously complicated by the fact that there are 
many different commodities that might benefit from ring r, in different ways. 

We will relax all constraints that connects different rings, so that the subproblem is separable into 
one trivial problem for each ring r. We relax constraints 1 , 2, 3, 5, 6 and 8 . Constraints 7 are 
modified to Yli Kr < Vr, and we add the constraints + y^?kr ^ 2/r V/, A;, r. We use the 

following Lagrange multipliers: a for constraints 1, p for constraints 2, 7 for constraints 3, a for 
constraints 5 , r for constraints 6 , and ^ for constraints 8 . Details of the subproblem and its solution 
can be found in [5]. 

If ring r is not used, we have y^ = 0, but also = xjf = = 0 VA;, hir = 0 Vz and 

= wflj. = 0 V/, A:. If ring r is used, we set yr = 1, which enables the above x-, h- and it;- variables 
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to be greater than zero. The total effect of all these variables can be combined into a reduced cost 
on the variable yr- 

The subproblem for ring r is solved by temporary assuming that yr is fixed, and calculating the 
best X-, h- and ly-solutions and the corresponding objective function value, Vr{yr)- For yr = I, the 



optimal objective function value will be 

/r = /r + E 0} ■ 



■ E min{4. 0} + E E cJJ?, 0} 

te/r I k 



fr = ft - Ei(Ejfc6C.P ^r* + E*6CPCr*)«ir “ Ei E*(ffrfc + ^Irkhlr ~ Ei^tyOir, ~ filk ~ 

firk + Ei«^|*r + Elf/rt. ^k° = + «* “ Plk + ~ 0^k + 

^k = C?h+al + al + (7l+al+^ll,-\-^^^, = C^&t + ,S4-7lfc + 7|tr+e,*r. cS? = ^rfc + 7i*+mr + f,\r 

4 = c'^-(E*ecp«^*r+Et6c°<^lr)+^frfor* >^.and4 = C^-E(b€CP<^kr+Efc6C/’<^i)-E*n*r+?^ 
for i <T. 



Here we can call fr the reduced cost of j/r, and to solve subproblem r, it only remains to solve 

min/r2/rS.t. ?/r€{0, 1} 

which has the optimal solution = 1 if /^ < 0 and 0 if fr > 0. In the column generation approach 
we assume that we have a certain subset of all possible rings included in the problem. We are then 
looking for new, better rings to include. The best ring (locally) is the one with the most negative 
reduced cost. If we cannot find any ring with negative reduced cost, we can deduce that the current 
set of rings is LP-optimal. Thus we wish to find the (so far unknown) ring with min/^. In order to 

_ r 

do this, we must explicitly specify how fr depends on a and e. 



/r = E + E 

(ij) * 



where t;^(e) = EH“{(E + ^rk> 0}) 

k (ij) 

and the coefficients before a are 

= - E Ck - E 4* - E4* - E4. - 4 

kecP keCP k k 

where the and ^^-terms are omitted for i > T, and the constants are 

Kf, = -PI, - = ci- Ph + 4r + erk. - Ph + <^1 + 

^rk = ^IJki = KJ^kr = /^r/fc+T/fc+mr+^/ir^ ^ir = 

~ T^keCP ^kr ~ T^keCP ^kr ~ Sfc +^tV for Z < T, = C^ — Y^keCP ^kr ~ T>keCP for 2 > T, 

= El Ek{mm{Krd, , 0}) + min{i^^, 0} and Kf, = mm{Kfk, Kf,}. 



The constant Kf, indicates the possible gain (with reversed sign) of x^, on ring r, i.e. the possible 
gain of using ring r as transit ring for commodity k. In order to use ring r for this, it is not only 
necessary that Kf, is negative, but, as the min-operator indicates, adding the linear cost of this 
operation (which is the demand bk times the unit length cost of the ring), the result must still be 
negative. The same interpretation can be made for the other constants. The collected gain related 
to the a;- variables is denoted by u^(e), which is a non-differentiable function of e. Furthermore, 
indicates the total possible gain of transit and node connection of ring r. 

Analyzing the expression for the reduced cost, /r, we find that the first term represents the price for 
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the fixed costs of the ring, while the second term represents the possible gain of including each node 
in each ring. The next term, u^(e), containing the min-operator, includes one part representing the 
price for the linear costs of the ring and one part representing the gain of using ring r for commodity k 
(the gain of x). Finally there is a constant term representing the gains of transit and node connection 
{w and h). 

The variable expressions inside the min-operators depend on e in exactly the same way, and not at 
all on a. So the only question that remains is how many times the term ^ij^ijr will appear, 
i.e. how large the weight will be on this expression, as compared to the other terms. This question 
actually concerns the balance between the linear costs and the fixed costs. 

The size of the weight is clearly depending on the result of the minimization, and we can here only 
make some estimations. First, letting = {k \ < 0}, we find that all terms in v^{e) with k ^ Jr 

will be zero. 

Then there is a certain ring r, with fixed a and e, that was used when solving PI, i.e. when calculating 
the dual variables (Lagrange multipliers). Let Jj. = {k : (E(t,j) < 0}, (which is the 

set of commodities that actually benefits from ring r when e = e). Then clearly 

= II ((H C^eijr)bn + K^^,) and /r = H ff-Cijr + H + OriY, 

keJr (t,j) (*j) * ihj) 

where 6r = ^keJr which is the total demand of all the commodities in Jr, i.e. that gain from the 
ring at e, and = Eibc/r ^rk + This is a linearization of v^(e) around e. 

However, if e is changed, the weight of the linear costs will be less or equal to 6r = ^keJr ^k^ which 
is the total demand of all the commodities in Jr, i.e. the commodities that might gain from the ring, 
and greater or equal to 0. (Also the constant will change, but that will not influence the values 

of a and e.) We will then get 

fr ~ hj^ijr “I" ^ir^ir "b ^r 

(iJ) * 

where kj = + 9rC^ = (1 + ? and Or might vary between 0 and Or. Note that the coefficients 

for a will not vary. Omitting all constants, we thus have an objective function for RGP, with one 
parameter. Or, for evaluating changes of ring r. 

The objective function above is well suited, locally, for evaluating changes in the length cost of the 
ring by changing e, via the coefficients lij, and increases of a, via the coefficients tTi^. However, it is 
not suited for evaluating a decrease of a, since if air = 1? then ^ = 0 for this i and r, which leads 
to itir = 0. However, it is clear that the gain of connecting a certain node to a certain ring might 
depend strongly on which nodes that are already connected to the ring. 

We estimate the effect of decreasing a certain air from 1 to 0, by using both primal and dual 
information. If air is decreased from 1 to 0, constraints 8 tell us that a number of corresponding 
primal variables must be zero, and the loss occurs if some of these variables were nonzero before the 
change. This information is directly available from the /("-constants. Specifically, if air = 0 then 
xl^ and x%^ must be zero for all k 6 Cf, xjif and must be zero for all k e C^, and hir must 
be zero. Furthermore, if z < T, then x^j^ must be zero for all k, w-^r must be zero for all k, and 
must be zero for all k. This means that if any of these variables gave negative contributions via 
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the corresponding A'-constants, these contributions are eliminated. We can thus calculate the loss 
coefficients Wir for all i, r such that a»r = 1- For more details see [5]. 

3.2 Obtaining Integer Solutions 

Column generation is a technique for LP-problems. In principle the integer problem could be solved 
by branch-and-bound, by using the integer version of column generation, sometimes called branch- 
and-price. Due to the difficulty of the problem, it might be unrealistic to expect to be able to find 
the exact optimal solution (and verify it) within a reasonable time. In such a case one can focus on 
heuristic techniques for generating feasible integer solutions. We will develop a local search heuristic 
that eliminate rings with little flow, by routing it in other existing rings, and expanding short rings. 
Assuming that the cost for an OADM is higher than the costs for transit and capacity on the rings, 
a heuristic idea is to decrease the number of connections, so that each node is connected to only one 
ring. This can be done using shortest path problems. 

Another way of finding feasible rings is to view it as a Multi Depot Vehicle Routing Problem (MVRP), 
see [6]. We then try to find routes (rings) starting at the depots (transit nodes) where each node has 
a demand on one unit. The capacity on each vehicle is the same as the maximum number of nodes 
that a ring could be connected to. We can then use heuristics designed for this problem. 

3.3 Solution Strategy 

We now present a solution method where we generate new rings based on the reduced costs described 
earlier. The method is described below. 

1 . Generate a feasible set of rings (possibly larger than needed) , so that a feasible solution can be 
constructed. 

2. Solve the LP-relaxation of model PI in order to get dual variables. 

3. Calculate and ^ir. 

4. Generate a new ring. This can be done by using a heuristic a) or a more exact optimization 
technique b). 

a) Increase a ring with new nodes by considering ^ir'. Find a itir <C 0. Find the shortest path 
from node i to ring r and back again without using the same arc or passing the same node 
twice. Calculate the reduced cost fr based on the extended ring. If Jr < 0, enlarge the 
set of rings with this new ring. Optionally, check several possibilities and choose the ring 
with the best reduced cost /r. 

Decrease a ring by considering ^ir'. Find a TTir close to zero, remove node i from the ring 
and calculate fr. Include the new ring if fr < 0. 

b) Solve the subproblem more exactly by solving a PCTSP, as described earlier, calculate 
the reduced cost /r, and include the new ring if fr < 0. 
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5. If the set of rings has been enlarged, go to 2 (and optionally remove rings that has been unused 
for several iterations), otherwise go to 6. 

6 . Find a feasible integer solution, using the LP-solution and a heuristic. 

4. A Numerical Example 

For illustrative purposes, we now discuss a small network with 10 nodes, including two transit nodes, 
and 9 commodities. We consider a situation where 6 rings are defined and included, see Figure 1 
where the rings are numbered by Roman numbers. 




Figure 1: A small network with ten nodes (including two transit nodes) and six rings. 

Nodes 3 and 7 are transit nodes, o = (1,2, 1,4, 4, 6, 3, 3, 2), d — (4,10,5,9,8,7,7,10,8) and h = 
(1,2, 1,1, 1,6, 3, 4,1). The rings have fixed costs /r=(786.8, 793.0, 761.8, 970.3, 513.2, 780.5), and 
the linear coefficients c^= (52.5, 52.9, 50.8, 64.7, 34.3, 52.0). The unit cost for transit at a transit 
node is = 20, and the cost for connecting a node to a ring is = 100. The maximum number of 
connected nodes to a ring is 7 (but a ring may pass more nodes). 

Solving PI with these rings yields the optimal objective function value 5092.6. The solution from the 
LP-relaxation is integral. The rings I, III, and V are used, i.e. = ?/3 = 2/5 = 1. Two commodities 
(2 and 9) need a transit ring (X32 = rrjg = 1), while four commodities (3, 4, 6 and 7) do not need 
any transit at all (xfg = x\^ = 2:36 = 2:37 = 1). 

We now make a heuristic attempt to generate a new ring, based on and the reduced costs Jr. We 
find the following large negative values: ^41 = —721.6, 7T22 = —1092.1, and #23 = —1092.1, which 
indicates that there seems to be a large gain in connecting node 4 to ring I, node 2 to ring II and node 
2 to ring III. Calculating the reduced cost, we get // = -525.6, fij = —712.1, and //// = -698.1. 
Also we notice that #82 = ~bl is rather close to zero and indicates that node 8 might be disconnected 
from ring II. The reduced cost is fi,= —233.5. The reduced costs indicate that the best ring to 
include is the modification of ring II, where we have included node 2. We must enlarge ring II with 
four new nodes. This new ring, called ring VII, passes the nodes 7, 8, 10, 9, 5, 2, 1, 3, 6 and back 
to the transit node 7 again. We include this ring to the set of rings and solve model PI again. The 
objective function value now decreases from 5092.6 to 5043.7, so the solution is improved. Now the 
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rings III and the new ring VII are used, while two previously used rings (I and V) are dropped. 
No transit rings are needed, and only two commodities (1 and 5) need transit. Furthermore, ring 
number VII passes nine nodes but is connected to six of them. Finally we note that the new solution 
of the LP-relaxation is still integral. 

Corresponding Author: M. Henningsson, Department of Mathematics 

Linkoping University, S-581 83 Linkoping, SWEDEN 
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Routenplanung fiir Gefahrguttransporte: 

Ein GIS-basierter Ansatz fur die Ermittlung kosten- und 
risikoeffizienter Routen 

Prof. Dr. Heinz Isermann, J. W. Goethe-Universitat, Frankfurt am Main 
Dipl.-Kfm. Michael Huth, J. W. Goethe-Universitat, Frankfurt am Main 

Summary: In diesem Beitrag werden Strukturen und Einsatzpotential eines Decision-Support- 

Systems (DSS) zur Routenplanung von Gefahrguttransporten vorgestellt. Die Applikation baut auf 
einem Geographischen Informationssystem (GIS) auf und nutzt dessen Starken im Datenmanage- 
ment fiir geographische Daten und in der Bereitstellung geographischer Analysefunktionen. Durch 
die Integration eines GIS gelingt es, geographische Daten (Verkehrsnetz, Ortschaften, Gew^ser, 
Wasser- und Naturschutzgebiete) und Sachdaten (Unfalldaten, Verkehrsleistungsdaten, demographi- 
sche Daten) zu verbinden und raumbezogene Schadenindikatoren fiir Streckenabschnitte zu ermitteln. 
Diese raumbezogenen Indikatoren lassen sich zur Risikobewertung von Transportrouten heranziehen. 
Das System ist in der Lage, auf der Basis der vom Entscheidungstrager verfolgten okonomischen und 
risikopolitischen Zielsetzungen funktional-effiziente Routen und auf der Basis der Risikopraferenzen 
des Entscheidungstragers Kompromifilosungen aus der Menge der funktional-effizienten Routen zu 
generieren. 

1. Einleitung 

Der Transport von Gefahrgiitern induziert ein Gefahrenpotential fiir Menschen, Tiere, Pflanzen, 
Umweltmedien, Sach- und Kulturgiiter. Sowohl fiir staatliche Institutionen als auch fiir Logistik- 
dienstleister, Chemie- und Mineralolunternehmen stellt sich die Aufgabe, das Risiko von Gefahrgut- 
transporten gering zu halten bzw. zu vermindern: Sowohl im Rahmen gesellschaftlicher als auch 
betriebswirtschaftlicher Entscheidungsprozesse kann ein Risikomanagement fiir Gefahrguttransporte 
einen nachhaltigen Beitrag zur Risikopravention bieten. Dariiber hinaus sind Transport versicherun- 
gen bestrebt, Risiken fiir Gefahrguttransporte ermitteln zu konnen, um diese Ergebnisse in die 
Pramienkalkulation einfliefien zu lassen. Somit besteht fiir eine EDV-Applikation zur Losung von 
Routenplanungsproblemen fiir Gefahrguttransporte ein breites Einsatzpotential. 

2. Gefahrgiiter und deren Schadenpotential 

Der Begriff Gefahrgut wird im deutschen Recht im Rahmen des Gesetzes iiber die Beforderung 
gefahrlicher Giiter (GBefGG) definiert; dort heifit es in §2, Absatz 1: ’’Gefahrliche Giiter [...] sind 
Stoffe und Gegenstande, von denen auf Grund ihrer Natur, ihrer Eigenschaften oder ihres Zustandes 
im Zusammenhang mit der Beforderung Gefahren fiir die offentliche Sicherheit oder Ordnung, ins- 
besondere fiir die Allgemeinheit, fiir wichtige Gemeingiiter, fiir Leben und Gesundheit von Men- 
schen sowie fiir Tiere und andere Sachen ausgehen konnen.” Bei einer Freisetzung von gefahrlichen 
Giitern aufgrund eines Storereignisses entstehen Gefahren fiir die Systemumwelt. Eine Gefahr ist die 
Moglichkeit der Entstehung eines Bedarfs, der wiederum aufgrund einer tatsachlichen Veranderung 
in dem Bestand oder Zustand von Objekten und Subjekten, bspw. Beschadigung einer Sache, Verlet- 
zung eines Korpers, Beeintrachtigung der Gesundheit oder sonstiger Rechtsgiiter, hervorgerufen wird 
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- es ist jedoch ungewifi, ob, wann und wieviel Bedarf entsteht [15]. In Verbindung mit der Freiset- 
zung von Gefahrgiitern aufgrund eines Storereignisses konnen Gefahren von den durch die Freisetzung 
ausgelosten Effekten, wie bspw. Luftdruckwellen, Bodenerschiitterungswellen, heifie Gase, Warme- 
oder radioaktive Strahlung, ausgehen. Um sowohl Gesetze und Verordnungen als auch in Notfall- 
situationen zu treffende Hilfsmafinahmen differenzierter auf das von den Gefahrgiitern ausgehende 
Gefahrenpotential abzustimmen, werden Gefahrgiiter gemafi dem von ihnen ausgehenden Gefahren- 
potential klassifiziert [8]. Die UN-Empfehlungen fiir den Transport von Gefahrgiitern unterscheiden 
9 Gefahrgutklassen, die teilweise noch Unterklassen enthalten, woraus sich insgesamt 20 Kategorien 
von Gefahrgiitern ergeben [20]. 

Die in §2, Absatz 1 GBefGG genannten Objekte und Subjekte sind Schutzobjekte, die vor Schaden 
im Sinne unfreiwilliger Einbufien an Rechtsgiitern durch freigesetzte Gefahrgiiter bewahrt werden 
sollen. Die verschiedenen Definitionen des Begriffs Schaden sind von der Wahrnehmung des Entschei- 
dungstragers abhangig und orientieren sich an den im Rahmen eines Risikomanagements verfolgten 
Zielen. Sie lassen einen weiten Spielraum, welche Effekte als negativ angesehen werden, d.h. der 
Kategorie Schadenursachen zugeordnet werden, und welche Schutzobjekte davon betroffen sind [14, 
hier sind quantitative Bewertungsansatze verschiedener Staaten aufgefiihrt, die insgesamt ein breites 
Spektrum an Schadendefinitionen abdecken]. Schaden werden grundsatzlich durch bestimmbare Ur- 
sachen hervorgerufen, so dafi nicht der Fall totaler Unsicherheit vorliegt. Ungewifiheit tritt insofern 
auf, als die Ursachen-Wirkungsbeziehungen nicht oder nicht im Detail bestimmbar sind. Im Rah- 
men risikoanalytischer Untersuchungen ist es notwendig, die durch die Freisetzung von Gefahrgiitern 
ausgelosten Effekte und ihre Wirkungen auf die zu beriicksichtigenden Schutzobjekte, die Schadenka- 
tegorien (z.B. Schaden hinsichtlich der menschlichen Gesundheit, okonomische Schaden, okologische 
Schaden, Schaden bei gesellschaftlichen Institutionen, Schaden hinsichtlich der Lebensqualitat) und 
die einer Schadenkategorie zuzuordnenden Schadenindikatoren (z.B. Anzahl der getoteten, schwerver- 
letzten, leichtverletzten Menschen) zu determinieren [2]. Das von uns konzipierte DSS erlaubt dem 
Entscheidungstrager, aus einer Menge gegebener Schadenindikatoren jene auszuwahlen, die fiir ihn 
konst itutive Indikatoren eines Schadens sind. 

3. Risikoanalyse fur Gefahrguttransporte 

Es hat nicht an Versuchen gefehlt, den Risikobegiff definitorisch festzulegen, aber aufgrund der 
subjektiven Wahrnehmung und den Zielsetzungen eines Entscheidungstragers sind Definitionen sub- 
jektiv und zweckbezogen und konnen somit keinen Anspruch auf Allgemeingiiltigkeit erheben. Der 
Risikobegriff wird immer im Zusammenhang mit unvollstandiger Information fiber die Wirkungszu- 
sammenhange der Realitat und den daraus resultierenden Ziel- und Planabweichungen, die moglicher- 
weise zu Schaden ffihren, gesehen. In der Gefahrgutlogistik kann der Begriff Risiko als Summe aller 
Moglichkeiten, dafi aufgrund von stochastischen Storereignissen negative Abweichungen zwischen 
dem a priori kalkulierten und a posteriori tatsachlich eintretenden Wert der vom Entscheidungstrager 
definierten Zielgrofie eintritt, aufgefafit werden [18]. Um nach der Ermittlung der unternehmerischen 
Ziele und Rahmenbedingungen eine Risikolage zu analysieren, d.h. Risikokomponenten identifizieren 
und bewerten zu konnen, sowie anschliefiend Mafinahmen zur Risikobewaltigung bzw. Risikopraven- 
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tion entwickeln, beurteilen, auswahlen, implementieren und uberpriifen zu konnen, bedarf es eines 
Risikomanagements. Unter Risikomanagement wird das systematische Anwenden von Methoden 
und Techniken verstanden, die das Unternehmen und dessen Ressourpen vor Verlusten und Schaden 
bewahren sollen [7]. Risikomanagement kann somit als Prozefi aufgefaBt werden [18]. Die Identi- 
fizierung von Risiken und deren Bewertung werden zusammenfassend als Risikoanalyse bezeichnet. 
Fiir die Risikoidentifikation erfolgt eine Abbildung des zu untersuchenden Systems und der relevan- 
ten Einflufifaktoren, die zu Storungen des Systems fiihren konnen; mit Hilfe von Methoden sollen alle 
Risikoparameter, d.h. objektbezogene, personenbezogene Einflufifaktoren, die sich auf die Hohe des 
Risikos Oder seiner Komponenten auswirken, erkannt, beschrieben und qualitativ bewertet werden. 
Im Rahmen der Risikobewertung erfolgt fiir die identiflzierten Risiken eine quantitative Bewertung. 
Hierbei sind alle Risikoparameter zu quantiflzieren und direkt oder in aggregierter Form als Risiko- 
kennzahlen darzustellen sowie der funktionale Zusammenhang der Risikokennzahlen zur Hohe des 
Risikos zu bestimmen. 

Abhangig von der Definition des Begriffs Risiko konnen verschiedene Risikomafie, die das Risiko unter 
verschiedenen Aspekten beschreiben, verwendet werden. Ein Risikomafi ordnet dem Eintritt eines 
bestimmten Schadenausmafies einen Wert zu. Entsprechend der betriebswirtschaftlichen Definition 
von Risiken, als Grad der Erfiillung der vom Entscheidungstrager definierten Ziele, dienen charak- 
teristische Kenngrofien der Wahrscheinlichkeitsverteilung des Schadens als Risikomafie. In diesem 
Zusammenhang werden Erwartungswert oder auch Streuungsmafie, wie Standardabweichung, Vari- 
anz bzw. Variationskoeffizient, haufig verwendet. In der Praxis, speziell in der Gefahrgutlogistik, 
wird der Erwartungswert der Schadenverteilung verwendet, der wie folgt formal dargestellt werden 
kann: 

R = E[g{X,k{x))] 

Hierbei beschreibt g eine Schadenfunktion in Abhangigkeit der Zufallsvariable A G die den 
Schaden beschreibt, und einem Kompensationsmafi k(x). In diesem Zusammenhang dient das Kom- 
pensationsmafi dazu, die Giite des verwendeten Risikomafies fiir Situationen, in denen die Warschein- 
lichkeit des Eintretens von unverhaltnismafiig grofien Schaden beriicksichtigt wird, zu bewahren. Da 
die hier aufgefiihrten Risikomafie in der Regel nur Teilaspekte des Risikos wiedergeben, konnen die 
hierdurch beschrankte Eignung und Aussagefahigkeit durch die Integration eines Kompensations- 
mafies ausgeglichen werden. Aufier diesen pragmatischen Ansatzen zur Risikoer mitt lung konnen 
auch theoretische Ansatze, die auf einem erwarteten negativen Nutzen bzw. Ruinwarscheinlichkeiten 
basieren, verwendet werden. Die formale Darstellung dieser theoretischen Ansatzen zur Risikobe- 
wertung kann wie folgt beschrieben werden: 

R = E[u{X,k{x))] 

R = P{—X > r) mit r > 0 

Analog zur durch ein Kompensationsmafi modifizierten Schadenfunktion wird in der ersten mathe- 
matischen Formulierung eine Nutzenfunktion u verwendet. Das zweite Risikomafi entspricht der 
Wahrscheinlichkeit, dafi der Absolutbetrag des Schadenausmafies grofier als eine bestimmte Grofie 
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r > 0 ist, die einen Ruin verursachen wiirde. Um im Rahmen der Risikobewertung die Schaden- 
verteilung bestimmen zu konnen, sind die Eintrittswahrscheinlichkeiten potentieller Schaden zu 
quantifizieren. Die Eintrittswahrscheinlichkeiten potentieller Schaden hangen wiederum von den 
aus der zugrundeliegenden Schadendefinition hervorgehenden konstitutiven Schadenindikatoren ab. 
Da in der Regel eine Folge von Storereignissen, dadurch ausgeloste Effekte und daraus resultierende 
Auswirkungen auf die beriicksichtigten Schutzobjekte zum Eintritt eines bestimmten Schadenaus- 
mafies fiihren, lafit sich die Eintrittswahrscheinlichkeit eines Schadens aus bedingten Wahrschein- 
lichkeiten ermitteln. 




Figure 1: Ereignisbaum fiir Gefahrgutunfalle 



Anhand des in Figure 1 dargestellten Ereignisbaumes wird die Berechnung von bedingten Wahrschein- 
lichkeiten erlautert: Die marginalen Eintrittswarscheinlichkeiten beriicksichtigter Ereignisse P{Ai) 
sind gegeben, wobei die Ereignisse Ai (mit i € I) z.B. einen Unfall, die Einwirkung auf bzw. 
das Offnen der Gefahrgutumschliefiung, den Austritt von Gefahrgut beschreiben. Bezugnehmend 
auf den Ereignisbaum lafit sich bspw. die Wahrscheinlichkeit des Ereignisses A3, das Offnen der 
Gefahrgutumschliefiung, als bedingte Wahrscheinlichkeit von der Schnittmenge der beiden Ereignisse, 
dafi ein Unfall eintritt (Ai) und dieser die Gefahrgutumschliefiung beeintrachtigt (A2), wie folgt be- 
stimmen: P(A3|A2n^i)- Durch diese qualitative und quantitative Abbildung von Risiken kommt 
der Risikoanalyse eine grundlegende Bedeutung fiir den gesamten Prozefi des Risikomanagements 
zu. Im Bereich der Gefahrguttransporte bilden entweder die gesamte logistische Prozefikette oder 
ausgewahlte logistische Prozesse (Transport-, Umschlag- oder Lagerprozesse) den Gegenstand der 
Risikoanalyse [18]. 

Der Ablauf der Risikoanalyse zur Ermittlung von Risikokennzahlen, die bei Planungsaufgaben im 
operativen Logistikmanagement Eingang finden, ist in unserem System wie folgt abgebildet [12, 
16](vgl. dazu Figure 2): 
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Figure 2: Phasen der Risikoanalyse 

1. Unfallanalyse: Im ersten Schritt der Risikoanalyse wird fiir jeden Streckenabschnitt die Ein- 
trittswahrscheinlichkeit eines Unfalls ermittelt. Die Datenbasis dazu besteht aus streckenab- 
schnittsbezogenen Daten hinsichtlich der Anzahl der Unfalle und des Verkehrsaufkommens [16]. 
Diese Daten werden als Attribute des Verkehrswegenetzwerkes gespeichert; sie stehen damit 
auch fiir andere thematische Auswertungen durch Geographische oder andere Informationssy- 
steme zur Verfiigung. 

2. Eflfektanalyse: Im zweiten Schritt werden die (bedingten) Eintrittswahrscheinlichkeiten fiir 
die Freisetzung einer bestimmten Menge eines transportierten Gefahrgutes ermittelt. Hier- 
bei werden Ereignisbaume eingesetzt, in denen die Ergebnisse empirischer Untersuchungen 
beziiglich der Effekte, die durch einen Unfall ausgelost werden konnen, enthalten sind [10] (vgl. 
dazu Figure 3). Effekte sind bspw. die Reaktionsart der Gefahrgutumschliefiung, die Art des 
Gefahrgutaustritts oder die Gefahrgutreaktion. Faktoren, die diese Effekte beeinflussen konnen, 
sind z.B. Art und Menge des transportierten Gefahrgutes, Art der Gefahrgutumschliefiung. Die 
Effekte sind systematisiert und moglichst vollstandig abzubilden. In spateren Ausbaustufen des 
Systems ist es auch denkbar, Funktionen oder Simulationsmodule anstelle der Ereignisbaume 
einzusetzen. 

3. Schadeiianalyse: Abschliefiend werden das durch die Gefahrgutfreisetzung verursachte Schaden- 
ausmafi bzw. die beriicksichtigten Schadenindikatoren ermittelt. Relevant e Faktoren im Rah- 
men dieser Analyse sind bspw. Art und Menge des freigesetzten Gefahrgutes, meteorologische 
Zustande (Windstarke und -richtung, Niederschlag) sowie fiir die relevanten Schutzobjekte 
kritische Werte (Grenzwert eines Schutzobjektes hinsichtlich einer Einwirkungsart wie bspw. 
thermische oder mechanische Einwirkung), deren Uberschreitung zu einem Schaden fiihrt. 



Insbesondere in der Unfall- und in der Schadenanalyse ist es notwendig, empirische Daten und/oder 
natur- und ingenieurwissenschaftliche Funktionen raumbezogen zu verarbeiten. Fiir die Unfallanalyse 
werden die Daten des Verkehrsnetzes vorgehalten und den Attributen dieser Netzdaten (wie z.B. An- 
zahl der Unfalle sowie Fahrleistungsdaten eines Streckenabschnitts) Werte zugewiesen. Im Rahmen 
der Schadenanalyse sind die durch die Schadenindikatoren definierten Schutzobjekte raumbezogen 







448 



-0.010 Rre only 

- 0.008 Rre and oollision 

- 0.683 Collision 



- 0.21 9 Overturn only 

- 0.039 Spontaneous leak 

- 0.041 Vehicle trivial failure 



0.265 Rxed obstacle 
0.073 Rxed and moving obstacle 
' — 0.662 moving obstacle 

-0.250 With puncture 
- 0.750 Without puncture 



Figure 3: Ereignisbaum und empirische ermittelte Eintrittswahrscheinlichkeiten fiir EfFekte eines 
Gefahrgutunfalls (Quelle: [10]) 

zu erfassen und das Schadenausmafi pro Schutzobjekt zu quant ifizieren. Diese Aufgabenstellungen 
bedingen den Einsatz eines Geographischen Informationssy stems. 

4. Geographisches Informationssystem (GIS) 

Ein Geographisches Informationssystem (GIS) ist ein computergestiitztes System, das raumbezogene 
und nicht- raumbezogene Daten vorhalt, verarbeitet und aus diesen Daten entscheidungsrelevante 
Informationen generiert [4, 13]. Die Bereitstellung von geographischen Funktionen und das Manage- 
ment von geographischen und nicht-geographischen Daten sind Starken, die in unserem DSS in den 
folgenden Bereichen zum Tragen kommen [17, 3]: 

• Logistische Planungsaufgaben setzen haufig auf Transport- oder Prozefinetzwerke auf; in un- 
serem EDV-System sind das Strafienverkehrsnetz, das Schienennetz sowie das Netz der Bin- 
nenwasserstrafien integriert, um auch verkehrstrageriibergreifende Analysen zu unterstiitzen. 
Fiir die Applikation werden die Knoten und die Kanten dieser Netzwerke als geographische 
Objekte vorgehalten. Gleichzeitig werden Umschlag- und Lagerprozesse als intra- und inter- 
verkehrstragerbezogene Kanten dargestellt, so dafi die logistischen Kernaufgaben Transport, 
Lager ung und Umschlag durch ein Prozefinetzwerk dargestellt werden konnen. 

• Im Rahmen der Schadenanalyse wird das potentielle Schadenausmafi hinsichtlich der Schutz- 
objekte ermittelt. Die Frage nach Art und Umfang eines Schadens der Schutzobjekte kann 
nur durch die Anwendung von geographischen Funktionen beantwortet werden (Ermittlung 
der Schutzobjekte im Wirkungsbereich des ausgetretenen Gefahrgutes, Ermittlung von Art 
und Umfang der Schadigung). Daher ist es notwendig, auch die Schutzobjekte als geographi- 
sche Objekte im GIS zu integrieren. Zu den Schutzobjekten zahlen bspw. Ortschaften oder 
Wasserschutzgebiete, je nach Detaillierungsgrad der Untersuchungen auch Kulturgiiter und 
Er holungsst atten . 

• Fiir die genannten geographischen Objekte (Knoten und Kanten der Verkehrsnetzwerke, Schutz- 
objekte) werden Attribute und deren Auspragungen vorgehalten, die fiir die Analyse logistischer 
oder risikoanalytischer Aufgaben relevant sind. Im Bereich der Verkehrsnetze sind als Attribute 
bspw. die Strafienklassen, Angaben zur Betriebskilometrierung, die Anzahl von Gleisen oder 




449 



die Breite von Fliissen zu integrieren. Auspragung des Attributes ” Strafienklasse” konnen 
Autobahn, Bundesstrafie, Landesstrafie, Kreisstrafie oder Gemeindestrafie sein. Fiir Schutz- 
objekte werden z.B. die Eigenschaften Art und Bezeichnung der Schutzobjekte, Grofie bzw. 
flachenmafiige Ausdehnung oder - bei Ortschaften - demographische Informationen, wie die 
Anzahl der Einwohner oder die Bevolkerungsdichte, vorgehalten. 

• Die in unserem DSS vorgehaltenen Datenbestande stammen aus unterschiedlichen Quellen und 
besitzen verschiedene Daten- und geographischen Formate. Durch die Bereitstellung geeigneter 
Funktionen ermoglichen GIS die Geokodierung dieser Daten, also die Herstellung eines ein- 
heitlichen Raumbezugs fiir die zu integrierenden Daten [9]. Die Geokodierung wird sowohl fiir 
geographische Daten und Schutzobjekte durchgefiihrt als auch fiir deren Attribute. 

• Geographische Funktionen werden auch in einzelnen Phasen der risikoorientierten Routenpla- 
nung benotigt [6, hier werden die Phasen, in denen GIS im Rahmen von Routenplanungen fiir 
Gefahrguttransporte zum Einsatz kommen, beschrieben]. GIS halten Funktionen vor, die bspw. 
die Entfernung zwischen zwei Punkten oder die Flache innerhalb eines Polyeders berechnen. 
Im Rahmen der Schadenanalyse kommt die Funktionalitat von GIS insbesondere dadurch zum 
Tragen, als sich jene Schutzobjekte ermitteln lassen, die im Wirkungsbereich eines freigesetz- 
ten Gefahrgutes liegen. Weiterhin lafit sich das Schadenausmafi fiir die im vorherigen Schritt 
selektierten Schutzobjekte ermitteln. 

• Ein GIS bietet dem Anwender visuelle Unterstiitzung: Bei der Parametereingabe und -variation 
werden Graphiken von digitalen Karten zur Eingabe verwendet. Wir nutzen dies, um den 
Anwender z.B. Quelle, Senke und mogliche Zwischenziele einer Route auswahlen zu lassen. In 
der Phase der Ergebnisausgabe werden die Resultate der Analysen und die ermittelten Routen 
(oder die optimale Route) innerhalb der Karte angezeigt. 




Figure 4: Aufbau des Decision-Support-Systems (in Anlehnung an [19] und [13]) 
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Unsere EDV-Applikation umfafit - neben einem Geographischen Informationssystem - eine umfangrei- 
che Modell- und Methodenbank zur Durchfiihrung von Risikoanalysen sowie fiir die Routenplanung. 
Dariiber hinaus werden Daten zu Gefahrgiitern vorgehalten (Name des Gefahrgutes, UN-Nummer, 
Gefahrgutklasse, besondere Vorschriften hinsichtlich Verpackung u. a.). Weiterhin ist ein Modul 
implementiert, welches Expertenwissen zu bestimmten Aufgabenstellungen abbildet; hier werden In- 
formationen zu Notfallmafinahmen (wie z.B. die im North American Emergency Response Guidebook 
beschriebenen Mafinahmen) sowie Vorschriften fiir die Verkehrstrager- und die Streckenwahl (die fiir 
den Strafienverkehr in der Gefahrgutverordnung Strafie GGVS aufgefiihrt sind) vorgehalten. Der 
Aufbau unseres Decision-Support-Systems wird in Figure 4 wiedergegeben. 

5. Generierung kosten- und risikoefflzienter Routen 

Die potentiellen Nutzer von computergestiitzten Risikoanalysen fiir Gefahrguttransporte verfolgen, 
wenn es um die Bestimmung einer Transportroute geht, unterschiedliche Zielsetzungen. Nicht nur 
dkonomische Ziele, wie z.B. Minimierung der Gesamtkosten, der Routenlange oder der Fahrtdauer, 
auch risikopolitische Ziele, wie bspw. die Minimierung des Gesamtrisikos einer Route, die Minimie- 
rung der Unfall wahrscheinlichkeit auf einer Route oder die Minimierung der Summe der potentiell 
durch freigesetztes Gefahrgut betroffenen Personen, sollten aufgrund der gesellschaft lichen Verant- 
wortung der Entscheidungstrager auf die Routenwahl Einflufi nehmen [1]. Wie sich unterschiedliche 
risikopolitische Ziele auf die Routenbildung auswirken, illustriert Figure 5 fiir einen Gefahrguttrans- 
port von Frankfurt am Main nach Ludwigghafen. 




Figure 5: Routenbildung bei unterschiedlichen risikopolitischen Zielen 



Soweit der Entscheidungstrager bei der Routenplanung gleichzeitig mehrere dkonomische oder risi- 
kopolitische Ziele verfolgt, ist ein Routenplanungsproblem bei mehrfacher Zielsetzung zu losen. Es 
sei V die Menge der fiir einen Gefahrguttransport von der Quelle q zur Senke s zulassigen Routen 
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und Zk{y) die vom Entscheidungstrager verfolgte k-te zu minimierende Zielfunktion {k = 

Dann lafit sich das Routenplanungsproblem bei mehrfacher Zielsetzung als Vektorminimumproblem 

zi{y) 

"min”{z(y)= ] 1 3/ e F} 

. 2K(y) . 

formulieren. Eine Route y* e Y ist genau dann funktional-effiziente Losung dieses Vektormini- 
mumproblems, wenn keine Route y' €Y exist iert mit der Eigenschaft 

Zk(y') < Zk(y') fiir A: = 1 , K 

und 

Zk{y') < Zk{y*) fur mindestens ein k e {l,...,Rr}. 

Die Auswahl einer funktional-effizienten Kompromifiroute, bei der aus der Sicht des Entscheidungstra- 
gers sowohl den okonomischen als auch den risikopolitischen Belangen in ausgewogener Weise Rech- 
nung getragen wird, kann auf drei Arten erfolgen [11]: 

1. Der Entscheidungstrager spezifiziert eine Kompromifizielfunktion, die sein Zielsystem repr^en- 
tiert. Die beziiglich dieser Kompromifizielfunktion optimale Route ist dann die gesuchte Kom- 
promifiroute. 

2. Dem Entscheidungstrager werden die endlich vielen funktional-effizienten Routen vorgelegt. Er 
wahlt auf der Basis seiner okonomischen und risikopolitischen Zielvorstellungen personlich eine 
der funktional-effizienten Routen als Kompromifilosung aus. 

3. Der Entscheidungstrager wahlt im Rahmen eines interaktiven Entscheidungsprozesses eine 
funktional-effiziente Kompromifilosung aus. 

Mit dem hier vorgestellten GIS-basierten System lafit sich zum einen eine beziiglich einer zu mini- 
mierenden skalaren okonomischen oder risikopolistischen Zielfunktion oder Kompromifizielfunktion 
optimale Route ermitteln. Dariiber hinaus sieht das Decision-Support-System die Moglichkeit vor, 
dafi der Entscheidungstrager im Rahmen eines interaktiven Entscheidungsprozesses auf der Grund- 
lage des modifizierten STEM-Verfahrens [11] eine funktional-effiziente Kompromifiroute ermittelt. 
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Das Groupage-Problem kooperierender Verkehrstrager 



Prof. Dr. Herbert Kopfer, Universitat Bremen 
Dipl.-Kfm. Giselher Pankratz, FernUniversitat Hagen 

Zusammenfassung: Infrastrukturelle Engp^se und verschMter Wettbewerb im Transportsek- 

tor zwingen die Transportunternehmen zur Effizienzsteigerung bei der Erstellung von Verkehrslei- 
stungen. Nicht allein eine Optimierung der individuellen Auftragsdisposition, sondern erst ein un- 
ternehmensiibergreifender Abgleich zwischen den verfiigbaren und benotigten Kapazitaten kooperie- 
render Transportunternehmen (Groupage) lafit eine deutliche Auslastungssteigerung der Produk- 
tionsfaktoren erwarten. Der Beitrag stellt das Groupage-Problem vor, das darin besteht, die sukzes- 
sive und unregelmafiig eintreffenden Transportauftrage moglichst giinstig den freien Kapazitaten der 
Kooperationsteilnehmer zuzuordnen. Als einzelwirtschaftlicher Ausgangspunkt wird das speditionelle 
Dispositionsproblem identifiziert und auf Varianten des Pickup-and-Delivery-Problems (PDP) und 
des Frachtoptimierungsproblems (FOP) zuriickgefiihrt. Bei Erweiterung dieser isolierten Betrach- 
tung um die Beriicksichtigung zwischenbetrieblicher Kooperation ergibt sich das Groupage-Problem 
als offenes, kombiniertes Zuordnungs-, Reihenfolge-, Flufi- und Schedulingproblem. Eine Diskussion 
dezentraler Ansatze zur Losung verwandter Problemstellungen schliefit den Beitrag ab. 

1 . Transportwirtschaftliche Ausgangssituation 

Die Transportwirtschaft befindet sich seit einigen Jahren im Umbruch. Im okonomischen Um- 
feld der Giiterverkehrsunternehmen fiihrte insbesondere das Zusammenwirken von Logistik- und 
GiiterstrukturefFekt zu einem Anstieg des gesamten Verkehrsaufkommens und gleichzeitig zu erhohten 
qualitativen Anforderungen an die Giiterverkehrsleistungen. Aufgrund seiner Systemvorteile im Hin- 
blick auf die neuen Anforderungen ist ein Grofiteil des erhohten Aufkommens vom StraBengiiter- 
verkehr zu tragen. Allerdings hat die Belastung des Strafienverkehrsnetzes mittlerweile die Ka- 
pazitatsgrenze der bestehenden Infrastruktur erreicht. Da mit einem Ausbau des Verkehrsnetzes zur 
Kapazitatserweiterung (Expansionsstrategie) aus okologie- und finanzpolitischen Griinden derzeit 
nicht zu rechnen ist, erscheint allein eine Besserauslastung der vorhandenen Kapazitaten erfolgver- 
sprechend (Intensivierungsstrategie). Der Zwang zu einer solchen Effizienzsteigerung bei der Er- 
stellung von Verkehrsdienstleist ungen, insbesondere im Strafiengiiterverkehr, wird verstarkt durch 
die Liber alisierung der Verkehrsmarkte im Zuge der Europmschen Integration, die eine Zunahme 
der Wettbewerbsintensitat auf den nationalen Verkehrsmarkten zur Folge hat (Deregulierungsef- 
fekt). Hier ist insbesondere die Aufhebung der Preisregulierung durch Wegfall der Tarifbindung zum 
1.1.1994 sowie die Freigabe der Kabotage Mitte 1998 zu nennen. 

2. Grundlagen der speditionellen Transportdisposition 

Zur Losung der aufierbetrieblichen Transportaufgaben, die eine Spedition fiir ihre Verladerkunden 
wahrnimmt, sind durch den Aufbau geeigneter Transportketten fiir jeden eintreffenden Transport auf- 
trag das giinstigste Transportmittel und der giinstigste Transportprozefi zu bestimmen [8]. Das Prob- 
lem der Transportkettenplanung wird im folgenden als das speditionelle Dispositionsproblem bezeich- 
net, wobei sich die vorliegende Darstellung auf eingliedrige, unimodale Transportketten konzentriert, 
bei denen die gesamte Transportkette ohne Transportmittel- Wechsel abgewickelt wird (ungebroche- 
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a) Auftragslage mit vier 
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b) Beispiel fur eine Routenpla- 
nung bei Durchfuhrung ailer 
Auftrage im Selbsteintritt 



c) Beispiel fur eine Sendungs- 
gestaitung bei Fremdver- 
gabe ailer Auftrage 



Abb. 1: Fahrzeugdisposition bei Selbsteintritt bzw. Fremdvergabe 

ner Verkehr) und als Transportmittel ausschliefilich Fahrzeuge des Strafiengiiterverkehrs zum Einsatz 
kommen. Die Ausgangsobjekte des speditionellen Dispositionsproblems sind 

• die Transportauftrdge^ die sukzessive und unregelmafiig eintreffen und durch Be- und Ent- 
ladeort, entsprechende Be- und Entladezeitfenster sowie Art und Kapazitatsbedarf der zu 
befordernden Giiter charakterisiert sind; 

• eine Anzahl betriebseigener Fahrzeuge fiir den Selbsteintritt, im wesentlichen gekennzeichnet 
durch die fahrzeugspezifische Kapazitat (Gewicht und Volumen) sowie eine fahrzeugspezifische 
Kostenfunktion zur Bestimmung der variablen Selbstkosten; sowie 

• eine Anzahl selbstandiger Frachtfiihrer fiir die Fremdvergabe, deren spezifische Tariffunktion 
zur Berechnung der sendungsbezogenen Transportent gelte dient. 

Im Rahmen der Transportmittelwahl hat eine Kraftwagen-Spedition zu entscheiden, ob ein Auftrag 
nach §412 HGB im Selbsteintritt, also mit speditionseigenen Fahrzeugen, oder als fremdvergebener 
Auftrag durch einen externen Frachtfiihrer nach §425 HGB ausgefiihrt werden soil (Make or Buy). 
Im Selbsteintritt durchgefiihrte Auftrage werden den Touren einzelner Fahrzeuge zugeordnet. Fiir 
jede Tour eines Fahrzeugs ist eine Route (Rundreise) zu bestimmen (Transport prozefi. Abb. lb). 
Die Planungssit nation im Selbsteintritt lafit sich als erweitertes dynamisches Pick- Up- and- Delivery- 
Problem mit Zeitfensterrestriktionen (DPDPTW) beschreiben [12]. Fiir jedes eigene Fahrzeug gibt 
eine fahrzeugspezifische Kostenfunktion Auskunft fiber die Hohe der variablen Selbstkosten in Ab- 
hangigkeit von der zurfickgelegten Strecke und der Einsatzdauer einer Tour. Die Bewertung ei- 
nes Tourenplans ergibt sich durch die Summe der variablen Selbstkosten ailer Touren der eigenen 
Fahrzeuge. 

Fremdvergebene Auftrage werden einzelnen Frachtffihrern zugewiesen. Dadurch geht mit der Trans- 
portprozefiplanung auch die Kostenverantwortung an den Frachtfiihrer fiber. Entscheidungsrele- 
vant ist aus Sicht der Spedition das an den Frachtfiihrer zu zahlende Beforderungsentgelt, die soge- 
nannte Fracht, die sich nach dem frachtffihrerspezifischen Tarif berechnet. Typisch fiir Transport- 
tarife ist der degressiv ansteigende Verlauf des Frachtbetrags bezfiglich der entgeltbestimmenden 
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GroBen (z.B. Transportentfernung und Transportgewicht) aufgrund einer Fixkostendegression. Durch 
eine geschickte Sendungsgestaltung, d.h. durch eine geeignete Aggregation der einzelnen Trans- 
port auftrage zu neuen Sendungen, lassen sich unter Ausnutzung der Tarifdegression haufig erheb- 
liche Ersparnisse gegeniiber einer Einzelabrechnung derselben Auftrage erzielen (Abb. Ic). Der 
tatsachliche Ablauf des physischen Transports ist nicht von der Sendungsgestaltung abhangig. Die 
zu minimierenden Gesamtkosten einer Sendungsgestaltung ergeben sich aus der Summe der Trans- 
portkosten aller Sendungen, aus denen sich die Sendungsgestaltung zusammensetzt. Das Problem 
der frachtoptimalen Sendungsgestaltung stellt ein Flufiproblem dar und unterscheidet sich damit 
strukturell von der Planungssit nation bei Selbsteintritt, in der ein kombiniertes Zuordnungs- und 
Reihenfolgeproblem zu losen ist. In der Liter atur des Operations Research ist dieses Problem als 
statisches Problem fiir den Ein-Fahrzeug-Fall formuliert worden und wird dort als das Frachtopti- 
mierungsproblem (FOP) bezeichnet [6]. Unter den Gegebenheiten der Groupage bedarf das Fracht- 
optimierungsproblem noch einiger Erweiter ungen, die sich insbesondere auf die Dynamik der Prob- 
lemstellung und die Beriicksichtigung mehrerer Fahrzeuge unterschiedlicher Frachtfiihrer beziehen. 
Mufi im Fall einzelner Frachtfiihrer die Durchfiihrbarkeit einer Sendungsgestaltung iiberpriift werden 
(z.B. auf die Einhaltung von Zeit- und Kapazitatsrestriktionen), wird gleichzeitig eine abschatzende 
Routenplanung erforderlich, die aber fiir die Kostenermittlung unerheblich ist. 

Aufgrund ihrer Interdependenz sind die genannten Teilprobleme des speditionellen Dispositionsprob- 
lems simultan zu betrachten (Abb. 2). Da zu keinem Zeitpunkt der Planung alle Informationen vor- 
liegen, die zur Ermittlung eines ex-post-Optimums erforderlich waren, spricht man von einer offenen 
Problemstellung. Die Bewertung eines Zustandes in der Gesamtplanung erfolgt anhand der Summe 
aus den im Selbsteintritt entstandenen variablen Selbstkosten einerseits und den bei Fremdvergabe zu 
zahlenden Transportentgelten andererseits. Unter der Voraussetzung, dafi jeder Auftrag auszufiihren 
ist und die Planung keinen Einflufi auf die Hohe der Transporterlose hat, kann das Ziel der Dispo- 
sition grundsatzlich darin gesehen werden, diese Kostensumme zu minimieren. Bei einer dynami- 
schen Problemstellung wie der vorliegenden bereitet das Kriterium der Transportkostenminimierung 
Schwierigkeiten, weil kein abgeschlossener Planungszustand ermittelt und somit auch dessen Kosten 
nicht angegeben werden konnen [9]. Um unter diesen Voraussetzungen verschiede Planungsvarianten 
bewerten zu konnen, eignet sich das Marginalkostenprinzip, dem zufolge ein eintreflFender Auftrag 
dort in den Plan einzufiigen ist, wo er gegeniiber der Situation ohne diesen Auftrag die geringsten 
zusatzlichen Kosten verursacht. Die Einplanung eines Auftrages nach diesem Kriterium behalt bis zur 
endgiiltigen Auftragsausfiihrung vorlaufigen Charakter; gegebenenfalls ist sie im Verlauf der Suche 
mehrfach der geanderten Auftragssituation anzupassen. 

Die meisten Systeme zur Unterstiitzung der Transportdisposition zielen auf eine verbesserte Touren- 
planung ab und beschranken sich somit auf den Fall des Selbsteintritts, ohne die Make-Or-Buy- 
Entscheidung zu beriicksichtigen. Aufgrund des fortgesetzten Trends zum Outsourcing des physi- 
schen Transports an Frachtfiihrer-Unternehmen gewinnt die Fremdvergabe jedoch zunehmend an 
Bedeutung. Es ist zu erwarten, dafi die Entwicklung von Verfahren zur Unterstiitzung bei der 
Losung des speditionellen Dispositionsproblems zu einer effizienteren Transportabwicklung in den 
Speditionsunternehmen fiihren wird. Allerdings ist das durch eine blofie Verbesserung des Pla- 
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Abb. 2: Das speditionelle Dispositionsproblem (schematisch) 



nungsvorgangs nutzbare Rationalisierungspotential begrenzt. Die mangelnde Transparenz der Trans- 
port markte verhindert eine effiziente Allokation der Ressourcen und Auftrage. Zwar ist eine zwischen- 
betriebliche Abstimmung im Rahmen normaler Geschaftsbeziehungen moglich, um freie Kapazitaten 
an Unternehmen mit komplementarem Kapazitatsbedarf zu vermarkten, doch erschweren die hohen 
Transaktionskosten solcher bilateraler Einzelkontrakte den Such- und Abstimmungsprozefi deutlich. 
Um diesem Allokationsdefizit zu begegnen, ist die Errichtung kooperativer Systeme unumganglich, 
in denen mehrere Speditionen ein auf Dauer angelegtes interorganisatorisches Beziehungsgefiige auf- 
bauen, um iiber einen Ausgleich ihrer Kapazitaten eine Effizienzsteigerung zu erreichen. 



3. Das Groupage-Problem 

3.1 Grundgedanke und Vorteilhaftigkeit der Groupage 

Eine speditionsiibergreifende Kooperation bietet Moglichkeiten zu einer deutlichen Steigerung der 
Auslastung der Produktionsfaktoren im Transportsektor. Der hierzu erforderliche unternehmens- 
iibergreifende Abgleich zwischen verfiigbaren und benotigten Transportkapazitaten wird als Groupage 
bezeichnet. Unter Kooperation wird dabei die freiwillige, iiber normale Geschaftsbeziehungen hin- 
ausgehende Zusammenarbeit rechtlich und wirtschaftlich selbstandiger Unternehmen verstanden, die 
mit der Kooperation eine hohere Zielerreichung als bei individuellem Vorgehen anstreben [8, 10]. 
Logistische Interorganisationssysteme [1], die neben Funktionen fiir den geordneten Informations- 
austausch zwischen den kooperierenden Unternehmen auch Mechanisipen fiir einen automatisierten 
Kapazitatsausgleich enthalten, werden als Groupage-Systeme bezeichnet [7]. Besonders fiir kleine und 
mittlere Speditionen ist eine Groupage-Kooperation interessant, um sich im regionalen Kostenwett- 
bewerb gegen Grofispeditionen zu behaupten. Fiir die einzelnen Kooperationsteilnehmer beruht die 
Vorteilhaftigkeit der Groupage vor allem auf einer Senkung der durchschnittlichen auftragsbezoge- 
nen Stiickkosten aufgrund der verbesserten Auslastung der eingesetzten Transport mittel. Zum einen 
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ist hierfiir die Vergrofierung der Dispositionsmasse verantwortlich, die sich aus der abgestimmten 
Vorgehensweise bei der Auftrags- und Ressourcendisposition der Speditionen ergibt (economies of 
scale). Zum anderen tragen Verbundvorteile (economies of scope) zur verbesserten Auslastung bei, 
wenn einzelne Speditionen ihr spezielles Know-How fiir besondere Transportaufgaben den anderen 
Kooperationsteilnehmern zur Verfiigung stellen, so dafi deren Transportmittel ebenfalls fiir diese Bon- 
der aufgaben eingesetzt werden konnen. Neben den genannten Produktionskosteneffekten entstehen 
auch positive Transaktionskosteneffekte, da die Anbahnungs- und Vereinbarungskosten fiir die gegen- 
seitige Inanspruchnahme von Transportkapazitaten erheblich geringer ausfallen als ohne Kooperation. 
Dariiberhinaus besteht die Moglichkeit, die in der Kooperation vollzogene Ressourcenbiindelung zur 
verbesserten Gewahrleistung der Kundenanforderungen zu nutzen. 

3.2 Problemstellung und Modellierungsansatze 

Gegeben sei eine Menge kooperierender Speditionen. Jede Spedition verfiigt iiber eine bestimmte 
Anzahl von Transportmitteln mit beschrankter Transportkapazitat. Als Transportmittel stehen 
jeder Spedition sowohl Fahrzeuge eines eigenen Fuhrparks als auch Fahrzeuge von Frachtfiihrern 
zur Verfiigung. Bei jeder Spedition treffen kontinuierlich Transportauftrage von Verladerkunden ein. 
Das Groupage- Problem besteht darin, eine moglichst giinstige Zuordnung aller eintreffenden Trans- 
portauftrage zu den freien Kapazitaten der Kooperationsteilnehmer vorzunehmen. Das Groupage- 
Problem kann als Erweiterung des speditionellen Dispositionsproblems um die Beriicksichtigung zwi- 
schenbetrieblicher Zusammenarbeit aufgefafit werden. 

Denkbar ware, das erweiterte Problem als Simultanplanungsproblem zu modellieren, bei dem die 
Transportkapazitaten der Kooperationsteilnehmer zu einem zentralen Ressourcenpool zusammen- 
gelegt werden, dem die Gesamtheit aller einzuplanenden Auftrage gegeniibergestellt wird. Diese Mo- 
dellierung korrespondiert unmittelbar mit der hierarchischen Extremform der Koordination und be- 
deutet im Grunde genommen eine Quasi-Fusion der Kooperationsteilnehmer zu einer Super-Spedition. 
Dieser zentrale Modellierungsansatz geht an der Kooperationswirklichkeit vorbei, weil er wichtige As- 
pekte der wirtschaftlichen Selbstandigkeit der Teilnehmerspeditionen aufier acht laBt: 

• Der zentrale Ansatz setzt voraus, dafi die disponierende Instanz iiber alle Informationen verfiigt, 
die zur Bewertung der Planungssituation erforderlich sind. Auf das zeitkritische Groupage- 
Problem angewendet bedeutet dies, zu jedem Zeitpunkt vollkommene Kenntnis iiber den ak- 
tuellen Bearbeitungsstatus aller Auftrage, den aktuellen Standort und die Auslastung aller 
Fahrzeuge, sowie einen aktuellen Gesamtiiberblick iiber die momentane Kosten- und Erlossitu- 
ation aller verbundenen Speditionen zu besitzen. Eine solche liickenlose Aufdeckung betriebs- 
interner Informationen widerspricht jedoch dem Informationsschutzinteresse der Speditionen. 

• Die Speditionen handeln in erster Linie im eigenen Interesse und versuchen, durch die Teil- 
nahme an der Groupage-Kooperation die eigene Zielerreichung zu verbessern. Eine Groupage- 
Transaktion, z.B. ein Austausch von Auftragen zwischen Kooperationsteilnehmern, findet nur 
statt, wenn daraus fiir alle an der Transaktion beteiligten Unternehmen ein individueller 
Nutzenzuwachs resultiert. Die Durchsetzung eines globalen Kostenoptimums durch eine zen- 
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trale Instanz widerspricht der individuellen Rationalitdt, da nicht ausgeschlossen werden kann, 
dafi einzelne Teilnehmer Einbufien zugunsten des Gesamtoptimums hinnehmen mufiten. 

• Fiir die Speditionen bedeutet der zentrale Ansatz die vollstandige OfFenlegung ihrer eigenen, 
muhsam aufgebauten Kundenbeziehungen. Aufgrund der ausgepragten Kundenschutzinter- 
essen der Speditionen ist ein solches Vorgehen inakzeptabel. 

Dezentrale Modellierungsansatze ermoglichen eine realistischere Abbildung des Groupage-Problems. 
Ein naheliegender dezentraler Planungsansatz, der die inharente Verteilung der Problemstellung 
nutzt, ergibt sich durch die Dekomposition des Groupage-Problems in zwei Ebenen: Auf der Spedi- 
tionsebene stellt sich fiir jede Teilnehmerspedition ein kombinatorisches Optimierungsproblem in 
Form des speditionellen Dispositionsproblems. Auf der iibergeordneten Ebene ist ein iiberbetrieb- 
liches Allokationsproblem fiir die Zuweisung der Auftrage zu losen. Die Speditionen pflegen ihre 
eigenen Kundenkontakte, akquirieren selbstandig Auftrage und nehmen die Bewertung einzelner 
Planungsvarianten dezentral auf der Basis lokalen Planungswissens vor. Es findet kein direkter 
Durchgriff auf die lokalen Kapazitaten durch eine ’’allwissende” Instanz statt. Somit ist die Au- 
tonomie der Kooperationsteilnehmer in bezug auf Informations- und Kundenschutzinteressen weit- 
gehend gewahrt. Ein weiterer Vorteil des dezentralen Ansatzes besteht in der Komplexitatsreduktion, 
die mit der Dekomposition der Planungsaufgabe einhergeht. Zur informationstechnischen Umsetzung 
dezentraler Planungsansatze werden seit einigen Jahren verstarkt Multi-Agenten-Systeme diskutiert. 
Obwohl sich bislang keine einheitliche Definition fiir den Agentenbegriff herauskristallisiert hat [4], 
besteht ein weitgehender Konsens fiber wesentliche Eigenschaften eines Agenten. Nach [15] ist ein 
Agent ein Softwaresystem, das autonom, d.h. ohne direkte externe Intervention operiert, mit anderen 
Agenten kommuniziert und interagiert, auf Veranderungen seiner definierten Umgebung reagiert, 
sowie zielgerichtet und eigeninitiativ (proaktiv) handelt. Arbeiten mehrere Agenten an der Losung 
eines Problems, so spricht man von einem Multi- Agenten- System (MAS). Aufgrund dieser Merkmale 
bietet es sich an, das Szenario der Groupage-Kooperation als MAS zu modellieren. 

4. Multi-Agenten-Systeme fur das Groupage-Problem 

4.1 Multi-Agenten-Systeme fiir verwandte Problemstellungen 

In der Literatur finden sich mehrere Ansatze zur Fahrzeugeinsatzplanung mittels Multi-Agenten- 
Systemen. Der folgenden Diskussion liegen ffinf dieser MAS zugrunde. Es folgt eine kurze, klassi- 
fizierende Charakterisierung der Systeme anhand wesentlicher Merkmale. 

Den Agentensystemen liegen jeweils unterschiedliche Transportszenarien zugrunde: Falk [2] und 
Gomber et AL. [5] gehen von einer Grofispedition aus, deren regionale Niederlassungen durch 
Koordination der lokalen Planungsaktivitaten eine verbesserte Planung auf Konzernebene erzielen 
wollen. Fischer et al. [3] und Sandholm [11] dagegen betrachten die Kooperation selbstandiger 
(Transport-)Unternehmen, die ausschliefilich ihre eigenen Interessen verfolgen und von einer ko- 
operativen Transportplanung individuell profitieren. Utecht [14] legt seinem Agentensystem ein 
Multi-Depot-Tourenplanungsproblem zugrunde. Obwohl es sich dabei um eine statische Problem- 
stellung in einem einzelwirtschaftlichen Szenario handelt, soil dieser Ansatz wegen seiner prinzipiell 
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iibertragbaren Dekompositionsidee hier Beriicksichtigung finden. 

Je nach Aufteilung der Problemlosungskompetenz ergeben sich verschiedene Moglichkeiten zur Dekom- 
position der Problemstellung, die sich fiir die Umsetzung in einem Agentensystem eignen. Falk, 
Gomber et al., Sandholm und Fischer et al. wahlen die nachstliegende Dezentralisierungsvari- 
ante, indem sie jedem selbstandig disponierten Fuhrpark einen Agenten zuordnen. Entsprechend 
ergeben sich je nach Szenario Standort-, Niederlassungs-, Unternehmens- bzw. Speditionsagenten. 
Die ersten drei der genannten Ansatze verwenden nur diesen einen Agententyp, der fiir die ihm 
zugeordneten Fahrzeuge ein erweitertes Tourenplanungsproblem mit heuristischen Verfahren lost. 
Demgegeniiber sehen Fischer et al. aufierdem fiir jedes Fahrzeug einen eigenen Agenten vor, der 
die Planung seiner Route eigenstandig vornimmt, wahrend der Speditionsagent die dezentral ermit- 
telten Plane der ihm zugeordneten Fahrzeuge koordiniert und von Zeit zu Zeit eine zentrale Nachop- 
timierung vornimmt. Da Utecht sein Verfahren nur fiir den Fall eines einzelnen Unternehmens 
konkretisiert, entfallt bei ihm der Speditionsagent. Ahnlich wie Fischer et al. verwendet er 
Fahrzeug- Agenten, die einen lokalen Optimierer zur Losung von Traveling-Salesman-Problemen be- 
sitzen. Zusatzlich jedoch wird jeder Auftrag durch einen eigenen Agenten reprasentiert, der sich nach 
einfachen Regeln um eine Zuordnung zu einem Fahrzeug- Agenten bemiiht. 

In alien genannten Ansatzen kommen zur Koordination der Agentenaktivitaten Verfahren zum Ein- 
satz, die sich an marktliche Koordinationsmechanismen anlehnen. Das von Utecht als ’’bilate- 
rale Verhandlungen” bezeichnete Rahmenkonzept beruht auf dem Prinzip der direkten Suche in 
Markten: Jeder Auftragsagent sucht nacheinander mehrere Fahrzeugagenten auf und sammelt In- 
formationen iiber die Marginalkosten alternativer Zuordnungen. Schliefilich versucht er, sich an den 
Fahrzeugagenten zu binden, dessen Angebot die geringsten Kosten aufwies. Dieser kann den Ein- 
planungswunsch ablehnen, wenn er sein urspriingliches Angebot aufgrund mittlerweile vollzogener 
Planungsanderungen nicht mehr aufrechterhalten kann. Eine erfolgte Zuordnung ist nur temporar, da 
die Fahrzeug- Agenten regelmafiig einzelne Auftrags- Agenten wieder ausgliedern, die daraufhin einen 
neuen Suchprozefi starten miissen. Zu bestimmten Zeitpunkten, in denen alle Auftrags- Agenten 
gebunden sind, werden Schnappschiisse des Gesamtsystems und die Gesamt kosten ermittelt. 

In den MAS von Falk, Fischer et al., Gomber et al. und Sandholm kommen Koordi- 
nationsmechanismen zum Einsatz, die die Metapher einer Auktion bzw. Ausschreibung unter den 
Agenten verwenden, z.T. unter Berufung auf das sogenannte Kontraktnetzprotokoll [13]. Der Ablauf 
einer Verhandlungsrunde ist dabei im wesentlichen derselbe: Ein als Koordinator fungierender Agent 
schreibt einen Transportauftrag an seine Partner- Agenten aus, wobei der Koordinator durchaus auch 
selbst an der Verhandlung teilnehmen kann. Die Partner fiihren unter Beriicksichtigung ihrer au- 
genblicklichen lokalen Planungssituation eine individuelle, heuristische Bewertung dieses Auftrages 
durch und geben auf der Grundlage dieser Bewertung ein Angebot iiber die Einplanung dieses Auf- 
trages ab. Den Zuschlag erhalt der bestbietende Agent; dieser iibernimmt daraufhin den Auftrag in 
seinen Plan. Alle anderen Agenten erhalten eine Absage. 

Die genannten MAS realisieren jeweils unterschiedliche Konkretisierungen dieses Verhandlungssche- 
mas. Unterschiede zwischen den Verfahren betreffen z.B. die Bewertung der Auftrage nach dem 
Marginalkostenprinzip (Fischer ET AL., Sandholm) oder mit dem auftragsbezogenen, dispositions- 
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spezifischen Deckungsbeitrag (GOMBER ET AL., Falk); die Definition verschiedener Strategien zur 
Auswahl auszuschreibender Auftrage und die Durchfiihrung von Neuausschreibungen bereits verhan- 
delter Auftrage (z.B. Sandholm); die Moglichkeit zur Initiierung von weiteren Verhandlungsrunden 
durch Partner- Agenten wahrend einer Haupt-Verhandlungsrunde (Falk); die Erweiterung des Ver- 
handlungsschemas um komplexe Kontrakte, die jeweils mehrere Auftrage (Gomber et al., Sand- 
holm) und mehrere Agenten (Gomber et al.) gleichzeitig umfassen konnen sowie die Klarung der 
Frage, ob eine Verhandlungsrunde erst vollstandig abgeschlossen sein mufi (synchrone Ausfiihrung), 
bevor ein neuer Auftrag verhandelt werden kann, oder ob ein Agent in mehrere Verhandlungen gleich- 
zeitig involviert sein kann (asynchrone Ausfiihrung; vgl. z.B. Sandholm). 

Bei Fischer et al. findet das Kontraktnetz nur innerhalb einer Spedition Anwendung. Um 
Verhandlungen zwischen Speditionsagenten zu ermoglichen, beschreiben Fischer et al. die Ein- 
richtung einer ” Auftragsborse” , die von den Speditionsagenten zur Bekanntmachung von Verkaufs- 
wiinschen einzelner Auftrage genutzt werden kann. Signalisiert ein anderer Speditionsagent Interesse 
am Kauf eines Auftrags, treten beide Speditionsagenten in einen bilateralen Verhandlungsprozefi ein, 
in dessen Verlauf sie ihre Preisvorstellungen durch mehrfaches Gebot und Gegengebot zur Deckung 
zu bringen versuchen. Eine individuelle Bietfunktion bestimmt die Veranderung des jeweils nachsten 
Gebots gegeniiber dem vorangehenden. 

4.2 Eignung der Ansatze fiir das Groupage-Problem 

Zunachst ist festzustellen, dafi keiner der Ansatze das speditionelle Dispositionsproblem in der hier 
beschriebenen Form zugrundelegt, da auf einzelwirtschaftlicher Ebene ausschliefilich der Fall des 
Selbsteintritts, nicht aber das Problem der Sendungsgestaltung bei Fremdvergabe Beriicksichtigung 
findet. Dennoch sind die verwendeten Koordinationsmechanismen wegen ihrer Entkoppelung von 
der lokalen Problemstellung grundsatzlich auch auf das Groupage-Problem anwendbar. 

Um das fiir die Groupage charakteristische Eigeninteresse der Speditionen in einem MAS abbilden zu 
konnen, ist es erforderlich, neben den Zuordnungsentscheidungen auch die daran gekniipften Zahlun- 
gen zwischen den an einem Kontrakt beteiligten Agenten durch den Koordinationsmechanismus 
festzulegen und somit die Verteilung des Groupage- Erfolgs moglichst fair zu gestalten. Gegeniiber 
den iibrigen Ansatzen zeichnen sich nur die MAS von Fischer et al., Gomber et al. und Sand- 
holm durch die explizite Beriicksichtigung solcher Zahlungen aus. 

Bei Fischer et al. zahlt der Kaufer eines Auftrags den vereinbarten Kaufpreis an den Verkaufer; 
der Anspruch auf die Frachteinnahmen geht auf den Kaufer fiber. Der Verhandlungsmechanismus 
stellt dabei sicher, dafi beide Parteien einen positiven Nutzen erzielen. Bei Sandholm gibt der 
Koordinator mit der Ausschreibung einen maximalen Betrag an, den er an den Partner als Entgelt 
fiir die Ubernahme des Auftrags zu zahlen bereit ist. In seinem Angebot gibt jeder Bieter analog 
einen minimalen Betrag an, den er vom Koordinator fiir die Ubernahme des Auftrags fordert. Bei 
Zuschlag erfolgt die Zahlung in Hohe des Mittelwerts beider Betrage, so dafi beiden Agenten ein 
Gewinn in Hohe der Halfte der Differenz bleibt. Im Gegensatz dazu nimmt der Ansatz von Falk 
bei der Auftragszuordnung keine Rficksicht auf die Eigeninteressen der Niederlassungen. 

Sowohl bei Fischer et al. als auch bei Sandholm ist es moglich, durch strategische Gebotsab- 
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gabe den individuellen Gewinn zu Lasten des Vertragspartners zu erhohen. Um dies zu vermeiden, 
verwenden GoMBER et al. Koordinationsmechanismen auf Basis der Vickrey- Auktion, bei der die 
dominante Strategie in der Angabe wahrheitsgemafier Bewertungen besteht. 

Im Hinblick auf das Informationsschutzinteresse der Speditionen erscheint es nicht ratsam, Dek- 
kungsbeitrage zwischen den Speditionen auszutauschen, da auf diese Weise implizit Informationen 
iiber den mit dem Auftrag verbundenen Erlose offengelegt werden. Die Verwendung von Deckungs- 
beitragen in den Ansatzen von Falk und Gomber ET AL. ist vermutlich auf das beiden Ansatzen 
zugrundeliegende Szenario einer Konzernspedition zuriickzufiihren, deren Niederlassungen ein nicht 
so stark ausgepragtes Informationsschutzbediirfnis im gegenseitigen Verhaltnis haben. 

Ein weiterer wichtiger Aspekt sind die mit den Koordinationsmechanismen verbundenen Transak- 
tionskosten, die mafigeblich durch die Anzahl der erforderlichen Kommunikationsvorgange bis zur 
vollstandigen Abwicklung eines Leistungsaustausches bestimmt werden. Hier erweist sich das Bor- 
senkonzept von Fischer et al. als nachteilig, da eine unbestimmte Anzahl von Geboten und 
Gegengeboten bis zur Preisfindung zwischen den Parteien kommuniziert werden mufi. Ebenfalls 
erhohter Kommunikationsbedarf wird durch die zweistufige Agentenarchitektur desselben Ansatzes 
verursacht, da vor jeder Transaktion zwischen Speditionen aufwendige speditionsinterne Verhand- 
lungen mit den Fahrzeugagenten gefiihrt werden miissen. 

Eine konsequente Weiterfiihrung des Rahmenkonzeptes der bilateralen Verhandlungen nach Utecht 
konnte darin bestehen, jedes Transportmittel der Kooperation (eigene oder Frachtfiihrer-Fahrzeuge) 
als eigenstandigen Agenten zu modellieren. Die Auftragsagenten treten so - unter Verzicht auf eine 
Zwischenstufe aus Speditionsagenten - direkt mit den Transportmittel- Agenten in Verhandlungen. 
Dem Informationsschutzinteresse der Speditionen kann bei einer solchen Losung durchaus Rech- 
nung getragen werden. Um das wirtschaftliche Eigeninteresse der akquirierenden Speditionen zu 
sichern, miiBten sowohl die Transportmittel- Agenten einer Spedition als auch die Agenten der von 
einer Spedition akquirierten Auftrage diese Eigeninteressen in den Verhandlungen vertreten. Prob- 
lematisch erscheint hier wiederum die kombinatorische Explosion der Kommunikationsbeziehungen 
in einem solchen System bei einer Vielzahl von Auftragen und Transport mitt ein. 



5. Ausblick 

In der Praxis herrscht reges Interesse an der Entwicklung von Groupage-Systemen. Es wurde gezeigt, 
dafi MAS grundsatzlich gut als Grundmodell zur Abbildung des Groupage-Problems geeignet sind. 
Einige der in den diskutierten Literaturansatzen enthaltenen Teilkonzepte sind auf die Groupage 
iiber tragbar. Die Entwicklung mufi in zwei Richt ungen vorangetrieben werden: Zum einen gilt 
es, in den Verfahren auf lokaler Ebene die Besonderheiten des speditionellen Dispositionsproblems, 
insbesondere in bezug auf die Fremdvergabe, zu beriicksichtigen. Zum anderen sind die Koordi- 
nationsmechanismen so zu erweitern, dafi sie nicht nur der global effizienten Allokation von Trans- 
port auftragen dienen, sondern die individuelle Rationalitat einer Groupage-Transaktion durch einen 
fairen Ausgleich des transaktionsbezogenen Groupage-Erfolgs gewahrleisten. 
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Summary: We introduce a new binary integer linear programming model to formulate a generalised 
version of the sensor location problem, that is, to locate a minimal-cost set of counting points in 
order to derive the complete traffic flow vector on non-simmetric directed transport networks. We 
deflne a pair of heuristic algorithms that determine tight lower and upper bounds on the number of 
traffic sensors. Experimental results on a set of randomly generated test problems show that, even 
on medium-large size networks, estimated O/D matrices are closer to the true ones when obtained 
by the newly proposed sensor location strategies than by the common method, or, conversely, such 
strategies are more efficient (i.e., they require a lower measurement cost) than the common method 
to attain a given reliability level for the estimates. Preliminary simulation results on the real-world 
case of Salerno (an Italian medium-large town) are also presented. 

1. Introduction 

Conventional transportation models describe traffic routes by means of Origin/Destination (O/D) 
trip matrices and behavioral assumptions on road users. The primary objective of this work is to 
merge the theoretical foundations of such models with the apparently conflicting viewpoint of a 
system-oriented model that ignores drivers’ route choice and moves through the network according 
to a node-based logic, which balances traffic flows so that continuity is always satisfled. With respect 
to conventional models, the process of network flow derivation is reversed. In fact, traffic flows on a 
subset of observed links are generally used to estimate the O/D matrix, while the other undetected 
flows are subsequently derived by assignment schemes (see, e.g., [8]). On the contrary, we first derive 
traffic flows on all network links based on the subvector of measured ones and then produce an 
O/D estimate, so that estimated flows should be consistent with observed flows for the complete set 
of network links rather than just for the subset of measured ones. Thus, the crucial point of the 
proposed approach is determining how many sensors to locate and where, in view of deriving the 
complete set of traffic flows on a transport network at minimum cost. 

In this framework, we solved the sensor location problem (SLP) by a number of greedy heuristics 
that determine lower and upper bounds on the number of traffic sensors. We performed numerical 
simulations on a set of randomly generated symmetric transport networks, where the developed 
heuristics achieve good performances [1]. Then, we showed that solving SLP improves the accuracy 
of common O/D estimation procedures when we deal with small-medium size networks and/or as far 
as the a priori available O/D matrix is outdated [3]. 

In this paper, we extend the proposed approach along the following lines. On the theoretical side, 
we first generalise the model in order to deal with additional classes of transport networks. In 
particular, we consider non symmetric directed graphs where each node pair is connected either 
by a single link or by two links oriented in the opposite directions. Second, we refine the binary 
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integer linear programming formulation of SLR Third, we improve the performances of the heuristic 
solution algorithms by introducing new node selection, tie-breaking and local search rules. On the 
experimental side, the newly proposed sensor location strategies allow us to enhance the reliability 
of the estimated 0/D matrix with respect to common methods, even on medium-large size networks. 
Moreover, we analyze how these strategies compare with the common methods on the real-world case 
of Salerno, an Italian medium-size town. 

This paper is organized as follows. In Section 2 we formulate the sensor location problem on non- 
symmetric directed graphs. Section 3 presents the mathematical model and the heuristic solution 
algorithms for SLR Section 4 analyzes the implications on 0/D matrix estimation, both on a set of 
simulation experiments and on the real-world case of Salerno. Section 5 contains some concluding 
remarks. 

2. Problem formulation 

A transport network is represented by a directed graph G = (V,E). Link € E joins a start- 
node u with an end-node v. Let Th measure total traffic flow between the pair of centroid nodes h 
{h = 1, . . . ,n). Centroid nodes are the sources and the sinks of all network paths. Let S be the 
set of all centroid nodes and Ssi C S the set of all sinks. Let peh > 0 V(e, h) denote link choice 
proportions, that is, the shares of using the different links e e E (for simplicity, subscripts uv 
have been dropped out from notation). Matrix T with elements Th is an 0/D trip matrix. 

Let IN{v) denote the set of ingoing links to v. Let OUT{v) denote the set of outgoing links from v, 
f(evw) traffic flow on the corresponding link in OUT(v) and F{v) the total incoming flow to for 
any v, w eV. For any node other than a centroid, the total incoming flow equals the total outgoing 
flow {flow conservation constraint). Total traffic flow entering any given node v can be distributed 
among the outgoing links from v by means of turning movement coefficients c(e„^y) relative to that 
node. These can be derived from the same 0/D matrix assignment model used for determining 
link choice proportions. Note however that the turning coefficients defined here do not keep track 
of the origin and/or destination of the related traffic flows, as it usually happens in the literature 
[7]. Turning coefficients are such that: i) c{ey^) > 0, J2evweouT{v) = G R; ii) F{v) share 

through link e^w € OUT{v) is equal to /(e^iy) = F{v) c(ev«,); iii) F{v) share through a given link 
e^yj G OUT{v) can be expressed as a function of F{v) share through any other link e^z ^ OUT{v) 
and the turning movement coefficients relative to the considered link pair: f{eyyj) = ^^c(eyy;). 

A node is measured if traffic flows on its incoming and outgoing links are collected by a sensor 
located on the node itself. A node is disclosed {completely disclosed) if traffic flows on its outgoing 
(incoming and outgoing) links can be derived without any sensor located on itself. The set of 
disclosed and completely disclosed nodes associated with a measured node v is the influence basin 
of V. Let M C y be the set of measured nodes and Adj{M) = {v G V : 3cyyj e E,w e M} be 
the set of adjacent nodes to measured ones. A combined cutset of G associated with M is the set 
Cm = {^vw e E : {v e M)U{w e M)U{v e Adj{M))} (e.g., in Figure 1, M={a,b}, while Cm is 
identified by the set of dashed links). Let us now introduce a method for obtaining the influence 
basin of a measured node. Assume that turning movement coefficient values are known. Given a 
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measured node u, outgoing flows from v itself and its adjacents are known, since either they are 
measured or can be directly derived from the measured ones according to the deflnition of turning 
movement coefficients. Now, it is possible to deflne a linear system in which unknown variables 





Figure 1: The combined cutset associated with M = {a, 6} 

are associated with one (and only one) outgoing flow from every node but v and any other node 
belonging to {Adj{{v}) U Ssi), while equations correspond to flow conservation constraints relative 
to every node of the network. Note that a further unknown variable has to be introduced for each 
centroid, since incoming and outgoing flows are not balanced for such nodes. When the number of 
unknown variables is equal to the number of equations, the system is uniquely solvable, so that all 
nodes are completely disclosed and included in the influence basin of v. Otherwise, when the number 
of unknown variables exceeds the number of equations, there are many possible solutions and further 
information must be gathered in order to disclose all nodes. In such a case, the influence basin of v 
includes only a proper subset of the network nodes, so that additional measurements are required. 
Formally, the flow conservation constraint system for G = (V, E) can be written as A • / + s = /c, 
where A is the \V\ x \E\ coefficient matrix, / is the flow vector, s is the balancing flow vector for 
centroids and k is a vector of known real constants. An interesting remark has to be made at 

this point. 

Property 1 (cumulative effect). The overall influence basin of a set of measured nodes M may 
include a number of disclosed nodes that are not contained in any basin associated with each single 
node of the set. 

The sensor location problem (SLP) consists in determining how many sensors to locate and where, 
in view of deriving the complete set of traffic flows on a transport network at minimum cost. It may 
seem reasonable to solve SLP by defining a set-covering problem on the influence basins of every single 
node and look for the minimal set of basins including all network nodes. Such a formulation, however, 
cannot take into account the synergic impact of measuring simultaneously a given subset of nodes 
(i.e., the cumulative effect). Therefore, it is both suitable and necessary to devise an alternative 
approach to the considered problem. For this purpose, let E = {cvyj e E : f{eyyj) is known}. 
Actually, when Cyy, G Cm, either f{eyy,) is measured or can be derived by the definition of turning 
movement coefficients. 

Property 2 If M CV is a set of measured nodes, then, for each e^u, G Cm, G E. 
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Given a transport network G = (V, E), let a collection of subnetworks Gj = (V^, i = 1 , . . . , g, be 
such that: i) ViU. . .UV^U. . .UV^ = V; ii) V-flV^- = 0 , Vi, j q}; iii) Ei = eE:v,we FJ, 

Vz 

Let E' = {cvty 6 £■ : V € Vi, u; € Vj, Vi ^ j € { 1 , . . . , 9}}. If Gi = (VJ, £'i) is a connected component 
of G, let Si = {v e {Vi n S)} be the set of centroids in Gi and Adji{M) = {u G (V^ fl Adj{M))} the 
set of adjacent nodes to M in Gj. Recall that A- f-{-s = k is the flow conservation constraint system 
relative to G. Then, the following lemma and proposition hold (for their proofs, see [2]). 

Lemma 1 If E’ = E j then A* • = A;*, Vi G { 1 , . . . ,g}. Thus, matrix A has a block structure, in 

the sense that the flow conservation constraint system relative to Gi is determined by the \Vi\ x \Ei\ 
coefficient matrix A', the flow vector /* = {f{evw),^vw ^ Ei}, the balancing flow vector s* for 
centroids in Gi and the vector of known real constants k* G 

Proposition 1 Given a set of measured nodes M, the combined cutset Cm decomposes G into q 
connected components. These components correspond to the distinct subsystems A* • /* + s* = A:* 
(i = 1, . . . , ^) in which A - f s = k can be decomposed. 

Now, we are in position to state the following main result. 

Theorem 1 Given a combined cutset Cm of G and one of its associated connected components Gi, 
every node v ^ Vi is completely disclosed if and only if |5i\55/| < \Adji{M)\. 

Theorem 1 provides the theoretical basis which is needed to solve SLR Actually, the optimal solu- 
tion of SLP can be achieved if and only if Theorem 1 holds within every connected component G* 
associated with the combined cutset Cm of the transport network G. However, the configuration 
of these components is in turn directly dependent on the availability of a sensor location pattern 
that can only be obtained as a solution of SLP. This circularity prevents a manageable mathematical 
formulation of the considered problem, so that it is more reasonable to define a relaxed model that 
works at the general network level, thus disregarding possible decomposition structures. 

3. Mathematical model and solution algorithms 

Given the graph G = (V, E) representing the transport network, let S be the set of centroids and 
Ssi C S the set of sinks. Let [a^v] be the adjacency matrix. If a^v = 1 , then node u is adjacent to 
node V, that is, there exists link e„t,. Since the network is not symmetric, the fact that node u is 
adjacent to node v (i.e. a^v = 1) does not necessarily imply that node v is also adjacent to node u. 
Actually, it may be the case that a^u = 0 , namely, node v is not adjacent to node u. 

Let Xy = 1 if node v is measured {xy = 0 else) and y^y = 1 if nodes u and v are both measured 
iVuv = 0 else). Moreover, let kuv = 1 if node u is adjacent to node u, and v is measured, while u 
is not measured but is adjacent to at least another measured node in addition to v {kyy = 0 else). 
Finally, let ly = I if node v is not measured but is adjacent to at least a pair of measured nodes 
{ly = 0 else). 
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Then, the mathematical formulation of the relaxed sensor location problem consists in a binary 
integer linear programming (BILP) model that can be expressed as follows: 



iv^l 

TTlifl CyXff (l) 

v=l 

IV^I |V| 

+ /„ + 5Z[a„„(a:„ - - fc„„)]} > |S\5s/| (2) 

V=1 U=1 

yuv>Xv-^Xu-l Vv ^ u = 1, . . . , |V| (3) 

Vuv < — yv ^u = l,...,\V\ (4) 

> Jv + (1 - X„) + - 2 Vv#u = l,...,|F| (5) 

Kv < ^ 'iv i^u = \,...,\V\ (6) 

IV'I 

^uv — ^ ^ 0>uwXw ^ IX — 1, . . . , |V^| (7) 

w^v=l 

2^Wjtv=l ^VVJ 

Vt> = l,...,|F| (9) 

1 1^1 

^ 9 S Vw = 1, . . . , |y| (10) 

^ WJ^V=1 



where 5„ = 1 if i; is a centroid node (5^ = 0 else) and Cy is node v measurement cost. 

Objective function (1) simply requires to minimize the cost of measured nodes. Constraint (2) is a 
relaxed form of Theorem 1, stating that the number of adjacent nodes to M should be greater than 
the number of centroids but the sinks |5\55/|. The other constraints are essentially related to the 
formal definition of the involved variables and are intended to force such variables to their feasible 
values 0 or 1. In particular, variable y^y has been introduced to avoid that a measured node u that 
is adjacent to another measured node v be included among the adjacent nodes to v. Variable kyy has 
been introduced to prevent that a node that is not measured, say node ix, but is adjacent to at least 
a pair of measured nodes, be included among the adjacents to each of these nodes. Finally, variable 
ly assures that a node that is not measured, say node v, but is adjacent to at least a pair of measured 
nodes, be included among the adjacents to exactly one of these nodes. 

Solving the proposed model by an exact algorithm presents significant computational issues. In 
fact, such BILP problems are hard to solve (see e.g. [6]). Moreover, the optimal solution of the 
model is the actual optimum of SLP if and only if it is such that Theorem 1 turns out to be verified 
within each connected component of the network (note that this is trivially true if the network is not 
disconnected by the combined cutset Cm associated with M). 

Now, we define a pair of heuristic algorithms that determine lower and upper bounds on the number 
of sensors. The first one, heuristic LB, finds a lower bound for SLP. It requires to sort the nodes 
according to their non-increasing degree, choose the one with the highest number of adjacents and 
place a sensor on it. The node selection process is iteratively performed until the total degree of 
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measured nodes augmented of the size of set M is greater than (or at least equal to) the number of 
centroids in the network but the sinks. Generally, the combined cutset identified by M can disconnect 
the network in such a way that Theorem 1 does not hold within each connected component. Hence, 
the achieved solution is not feasible. The second heuristic algorithm finds an upper bound for SLP. 
This consists of the following steps. 

Heuristic UB 

Step 0. Initialize G = {V,E), N = V, M = d. 

Step 1. For each connected component where Theorem 1 does not hold, choose the node v which 
maximizes: = E?=iP/i, in which = 1 if p^h ^ 0 Ve € (IN(v) U OUT{v)) {peh = 0 else). Then, 

put it in tail of list M. Break the ties in favour of the node with the highest degree. 

Step 2. If Theorem 1 is satisfied within each connected component, a feasible solution is obtained; 
then STOP. Else, goto step 1. 

For each connected component where Theorem 1 does not hold, heuristic UB requires to measure the 
node V which is crossed by the largest number of 0/D paths. When Theorem 1 is satisfied within 
each connected component, a feasible solution is obtained. Figure 2 reports the computational 
performances achieved by the proposed heuristics. The obtained results significantly improve the 
performances of the algorithms developed in our previous work (for further details, see [1]). 
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Figure 2: Computational results 

4. Implications on O/D trip matrix estimation 

Once the vector of traffic flows in the whole network has been obtained, as final output of the SLP 
solving procedure, it is possible to estimate and successively update the O/D trip matrix. Actually, in 
this section the BILP model and the heuristic solution algorithms introduced for SLP are embedded 
within a comprehensive approach which consists of the following stages. 

Stage one. It derives traffic flows in the whole network, based on data collected by a properly 
dispersed minimal-cost set of counting points. 

Stage two. According to the results of stage one, it produces an accurate estimate of the O/D trip 
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matrix. 

Turning movement coefficients at each node are of primary importance in stage one, in order to 
extend traffic flows measured by road sensors over the whole network. On the other hand, link choice 
proportions are essential in stage two (together with the results of stage one) in order to estimate the 
0/D matrix. These stages are iteratively performed, so that the estimated matrix at a given period 
influences the process of network traffic flow derivation in the next period, that, in turn, allow the 
0/D matrix updating. In particular, by assigning the current 0/D matrix on the network we can 
update both turning movement coefficient values at each node and traffic flow proportions between 
each 0/D pair using the different links. As a consequence, we can extend traffic flows detected by 
road sensors in the next period over the whole network and then update the 0/D estimate. 

A common key objective of the variety of the proposed 0/D estimation models (for a comprehensive 
review, see [8]) is to guarantee the quality of the obtained results. Since the 0/D estimation error 
depends on the number and sites of observed links, it is essential to identify how many sensors to 
locate and where in order to achieve reliable estimates. Despite of the significance of the problem, 
to our knowledge the only sensor location criterion developed in the literature is the so-called “0/D 
Covering Rule” [9]. This rule states that, if the 0/D estimation error has to be bounded, counting 
points must be located on the network so that the trips between any 0/D pair are observed for at 
least a link of their path, or, in other words, a positive proportion of trips between each 0/D pair 
crosses at least one of the observed links. On the other hand, sensor location is crucial within the 
approach proposed in this paper. In fact, solving SLP allows to exploit a link flow knowledge base 
that is relative to the whole network. It follows that the 0/D estimation error is always bounded, 
given that the trips between each 0/D pair must cross at least one of the network links. Note that 
this happens even if the 0/D Covering Rule is not satisfled in a strict sense. 

Let us now investigate the implications of the sensor location pattern determined by solving SLP on 
the accuracy and reliability of 0/D matrix estimation procedures. In this respect, we have evaluated 
experimentally the performances of the common vs. proposed approach on a series of randomly 
generated test problems. The number and sites of traffic sensors in the test-networks have been 
selected by either solving SLP (in the case of the proposed approach) or satisfying the 0/D Covering 
Rule (in the case of the common method). On the other hand, the available link choice proportions 
and turning movement coefficients at each node have been obtained by assigning the prior 0/D 
matrix through a deterministic user equilibrium model (for further details, see [5]). 

It is assumed for simplicity that every link has the same measurement cost. Then, reliability assess- 
ments are made based on the same number of measured links. Generally, a number of measured links 
have resulted that almost doubles the number of centroids in any test-network (centroid nodes have 
always been located at the periphery of the networks, in a number that is of the order of the square 
root of the total number of nodes). Note that, in order to make an effective comparative analysis, it 
is assumed that measuring a node is the same as collecting traffic flows (i.e., placing a sensor) on each 
of its incident links, but the one which can be derived through the flow conservation constraint. In 
this framework, the Generalised Least Squares (GLS) estimator (see [4]) has been chosen to compare 
the 0/D estimation error associated with the proposed and common approaches. These have been 
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assessed by computing the following quantity: WMSE (%) = \ — — 7 ^ 100, that is, the 

)| 2^h=l^h 

weighted mean square error between the estimated 0/D matrix T and the true one T*, where the 
latter has been assumed known. 

Figure 3 shows the values taken by WMSE as a function of network size (where \V\ = 50, 100, 200, 400) 
and the a priori 0/D matrix or, more exactly, the distance D between such a matrix and the supposed 
true one (where D = d=10%,D = ±20%, £> = ±35%). Computational results reported in Figure 3 
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Figure 3: Reliability of estimated matrices 

are average values over 5 randomly generated networks. Except for a single case (where we have 
large networks and a rather outdated prior matrix), the following conclusions can be drawn from the 
presented figure. 

Fact 1 When traffic sensors are located by solving SLP, the O/D estimation error is lower than the 
common methods, measurement costs being equal. 

Fact 2 The number of sensors determined by solving SLP is less than or, at most, equal to the 
common methods, O/D estimation error being equal. 

It follows that estimated matrices are closer to the true ones when obtained by the proposed approach 
than by the common method, or, conversely, the proposed approach is more efficient (i.e., it requires a 
lower measurement cost) than the common method to attain a given reliability level for the estimates. 
Now, we analyse 

the real-world case of the city of Salerno, a medium-large size town located in southern Italy (see 
Figure 4). Its transport network can be represented by an asymmetric directed graph that includes 
555 nodes (among which there are 62 centroids) and 1133 links. Simulation results indicate that the 
achieved performances are still not completely satisfactory. According to the WMSE index, when 
traffic sensors are located on the basis of the O/D covering rule, the O/D matrix estimate is more 
accurate than the one obtained by locating sensors on the basis of the solution of SLP (WMSE = 
112.67 vs. 130.10). This happens despite of the fact that the O/D covering rule requires a lower 
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Figure 4: The city of Salerno 



number of links to be measured (83 vs. 99). Let us introduce a different performance index, defined 

T* -Th 

as WRE = — — 7 ^ • 100. According to WRE, our approach achieves a better performance (5.44 
VS. 5.78). It follows that the superiority of either method cannot be stated unambiguously. However, 
it is worth noting that the performance of our approach could be remarkably improved by developing 
some ad-hoc sensor location strategies which are able to take into account the particular structure 
of the considered transport network. 



5. Concluding remarks 

Origin/Destination (0/D) trip matrix estimation procedures generally rely on traffic flow information 
collected at a number of measured links. In this paper, we introduced a new binary integer linear 
programming model to formulate a generalised version of the sensor location problem, that is, to 
locate a minimal-cost set of counting points in order to derive the complet traffic flow vector on 
non-simmetric directed transport networks. We defined a pair of heuristic algorithms that determine 
tight lower and upper bounds on the number of traffic sensors. Experimental results on a set of 
randomly generated test problems show that, even on medium-large size networks, estimated 0/D 
matrices are closer to the true ones when obtained by the newly proposed sensor location strategies 
than by the common method, or, conversely, such strategies are more efficient (i.e., they require a 
lower measurement cost) than the common method to attain a given reliability level for the estimates. 
Although encouraging, preliminary simulation results on the real-world case of Salerno (an Italian 
medium-large town) are more controversial. Actually, at the present state of research, the superiority 
of either method cannot be stated unambiguously. 
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AHP-gestiitzte Evaluation von Verkehrsprojekten 
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1. AHP-Basis 

In den letzten Jahren hat die AHP-Methode (Analytic Hierarchy Process) — insbesondere wegen 
ihrer vielfaltigen Einsatzmoglichkeiten in der Praxis — weltweit grofie Beachtung gefunden. AHP 
kann auch zur Evaluation von Verkehrsprojekten verwendet werden, wozu in der Literatur zahlreiche 
Berichte und ausfiihrlich kommentierte Fallstudien vorliegen. Diese werden zumeist nach dem AHP- 
Standardverfahren bearbeitet, zu dem auch leistungsfahige EDV-Programme verfiigbar sind (vgl. 
Saaty 1990; Saaty 1994; Buede 1992). 

1.1 AHP-Methodologie 

1.1.1 AHP-Standardverfahren 

Das AHP-Standardverfahren lauft in drei Phasen ab. Die erste dient zur Aufgabenumschreibung und 
Personab/Sachmittelzuordnung. Von zentraler Bedeutung ist die Aufgabenumschreibung. Sie dient 
zur Festlegung des allgemeinen Untersuchungsziels (goal) und zur generellen Beschreibung potentiell 
untersuchungsrelevanter Alternativen. 

Die zweite Phase betrifft den AHP-Einsatz und besteht in der definitiven Selektion der zu un- 
tersuchenden Alternativen (alternatives) sowie in der Festlegung der zu ihrer Evaluation vorgese- 
henen Kriterien, die iiblicherweise als Attribute (attributes) bezeichnet werden. Nach dem AHP- 
Standardverfahren sind die bei der Alternativenevaluation zu beriicksichtigenden Attribute — von 
einfachen Verhaltnissen abgesehen — hierarchisch anzuordnen. Generell sollte die Zahl der in eine 
Hierarchiestufe zu libernehmenden Attribute relativ eng (7 ± 2) beschrankt werden; dagegen besteht 
beziiglich der hierarchischen Tiefengliederung keine Begrenzung. 

Die dritte Arbeitsphase ist durch den besonderen — in der Bestimmung und Weiterverarbeitung 
der Attributsgewichte zum Ausdruck kommenden — Charakter des AHP-Ansatzes gepragt. Dabei 
ist zwischen lokaler und globaler Attributsgewichtung zu differenzieren. Beziiglich der ersteren gilt 
der Grundsatz, dafi hierarchisch gleich eingestufte Attribute in Bezug auf die Auspragung des ihnen 
unmittelbar vorgelagerten Ankerelements — unter Verwendung einer einfachen Grundskala (funda- 
mental/intensity scale, vgl. Saaty 1994, p. 73; Saaty 1996, p. 24) — paarweise zu vergleichen und die 
dabei erzielten Ergebnisse in einer speziellen Matrix weiter aufzubereiten sind. 

Bei n identisch verankerten Attributen ist die Erstellung einer (n, n)— Vergleichsmatrix (comparison 
matrix) auf der Basis von n(n — l)/2 Direktvergleichen vom Typ = Wi/wj erforderlich, wobei mit 
Wi resp. Wj die Wichtung der Attribute i resp. j in Bezug auf das ihnen gemeinsame Ankerelement 
angegeben wird. iiblicherweise werden die Werte aij,i < j, anhand von Direktvergleichen ermittelt. 
Die iibrigen Positionen der Evaluationsmatrix betreffen das als Vergleichsbasis benutzte Attribut 
(Basiselement) an = 1 oder ergeben sich als reziproke Werte aji = 1/a vorgangig durchgefiihrter 
Direktvergleiche. 
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Die Weiterbearbeitung der Evaluationsmatrix erfolgt in mehreren Schritten und fiihrt letztlich zur 
Berechnung der Attributsgewichte; vgl. Tabelle 1. Aufierdem sind nach dem AHP-Ansatz die ori- 
ginaren Evaluationsmatrizen auch einer Konsistenzpriifung zu unterziehen. CR-Werte (consistency 
ratios) iiber 0.1 lassen eine Uberpriifung der ay— Werte als angezeigt erscheinen. 



Tabelle 1. Attributsgewichts-Berechnung und Konsistenzpriifung 



1. Erstellen einer Evaluationsmatrix mit 

n(n - l)/2 direkten Attributspaarvergleichen 

/ 1 Oi2 ... flln \ 

l/Oi 2 1 ... 02n 

A= . 

\ l/Oi„ l/ 02 n ... 1 / 

und Spaltensummenberechnung, 
e=(l,l,...,l) 

e A = X; Oil . . • E au . . . 2 o*n 


2. Normierung von A auf Basis 1.00 

/ • 0.lnlYl^in \ 

012 / 1^011 ... 02 n/X)atn 

B — ^ 

\ a\nf X] an . . . Ann/ X) / 

und Zeilen- /Spaltensummenberechnung 
B = (sl,s2, ...,sn)^ 
e 3 ^ = n 


3. Eigenvektorberechnung, 
n Attributsgewichte 

ty = (u;i , u; 2 , . . . , Wnf = (si/n, S 2 /n, . . . , 5„/n)^ 


4. Kennwertberechnung fiir Konsistenzpriifung 
auf Basis A w = XmaxUJ 
A{wi,W 2 ,...,Wn)'^ = (vl,v2,...,t;n)^ 

Amo* = e{Vi/wi,V 2 /W 2 , . . . ,Vn/WnV (l/u) 

CI = (Xmax-n)/(n-l) 

CR = CI/Rn 


7i„-Werte nach Saaty /Forman; 

Ri = 0 . 00 ; Rz = 0.52; ^4 = 0.89; Ri = 1.10; Rt = 1.25; flr = 1.35; Rf = 1.40; Rg = 1.45; flip = 1.49 



Tabelle 2 zeigt die in der zur Attributsgewichtung liblicherweise verwendete (1— 9)— Skala (vgl. Weber 
1993, S. 88); aufgrund von Tabelle 3 kann die AHP-spezifische Eigenvektor-/Endgewichtsberechnung 
leicht nachvollzogen werden. 

Die gezeigte Berechnungsweise wird als Additionsverfahren bezeichnet. Die Bestimmung der At- 
tributsgewichte kann alternativ auch nach dem Multiplikationsverfahren — durch zeilenweisen Mul- 
tiplikation der n Elemente der Evaluationsmatrix mit nachfolgender Division durch die Spaltenzahl 
n — erfolgen; das Verfahren wird aber selten verwendet. 



Tabelle 2, (l-9)-Skala 



Werte 


aij = Wi/wj 


Umschreibung: Gewicht des Basis-(Zeilen)Elements i, u/j, in 
Relation zum Gewicht des Vergleichs-(Spalten) Elements j, u/j, j = l(l)n 


1 




gleich 


3 


(1/3) 


etwas grofier (kleiner) 


5 


(1/5) 


wesentlich grofier (kleiner) 


7 


(1/7) 


viel grofier (kleiner) 


9 


(1/9) 


sehr viel grofier (kleiner) 


Verwendbar sind auch die Zwischenwerte 2, 4, 6, 8 (resp. 1/2, 1/4, 1/6, 1/8). 



Nach Berechnung aller lokalen Prioritaten (local priorities, L) auf samtlichen Hierarchiestufen er- 
folgt eine integrative Attributsgewichtung durch multiplikative Verkniipfung der Stufengewichte. 
Der Rechenprozefi lauft — ausgehend vom obersten Element (Ziel/goal) des Gesamtmodells oder 
separat zu bearbeitender Submodelle — sukzessive iiber alle Hierarchiestufen ab und fiihrt dort zur 
Ausweisung der entsprechenden Globalgewichte (global priorities, G). Fiir die G-Berechnung der 
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aktuell bearbeiteten Attribute gilt generell 



^aktuell — G Anker ' ^ aktuell • 



Die fiir alle Elemente der jeweils untersten Hierarchiestufe ausgewiesenen G-Werte gehen als Ba- 
sisfaktoren unmittelbar in die Alt ernativene valuation ein. Sie werden mit den fiir die einzelnen 
Alternativen — in Bezug auf die einzelnen Basisattribute ermittelten — Lokalgewichte multipliziert. 
Die Einzelergebnisse sind zu addieren; ihre Summe bildet die Grundlage zur Rangeinstufung der 
Alternativen. 



Tabelle 8. Eigenvektor-/Endgewichtsherechnung 



Zeilenattribute - 
Basis-/Spalten- 
attribut 4^ 



Attributsgewichte, 
origin^e Werte 



Attributsgewichte, 
normiert auf Basis 1.000 



Zeilen- 

summe 



Durch- 

schnitt/ 

Gewicht 



1 7 8 

1/7 1 2 

1/8 1/2 1 

1/9 1/4 1/4 



0.725 0.800 0.711 0.500 

0.104 0.114 0.178 0.222 

0.091 0.057 0.089 0.222 

0.080 0.029 0.022 0.056 



2.736 

0.618 

0.459 

0.187 



0.684 

0.155 

0.115 

0.046 



Spaltensumme 



1.38 8.75 11.25 18.00 1.000 1.000 1.000 1.000 



4.000 



1.000 



Ergebnisse von 4(4 - l)/2 = 6 Direktvergleichen, a»j = Wifwj, sind fett gedruckt. 
CR = 0.09 



Die Lokalgewichte der Alternativen konnen nach zwei verschiedenen Verfahren bestimmt werden. 
Die einfachere Methode (absolute measurement) entspricht einem Benotungsverfahren hinsichtlich 
der einzelnen Basisattribute. Grundsatzlich konnen die Alternativen aber auch — in Analogie zur 
Attributsgewichtung — anhand von paarweise durchzufiihrenden Vergleichen (relative measurement) 
bestimmt werden. Beziiglich des letzterwahnten Verfahrens ist zu beachten, dafi es sich bei einer 
grofieren Zahl Alternativen als recht aufwendig erweisen kann, sind doch bei m Alternativen und n 
Basisattributen insgesamt n(m[m — l]/2) Direktvergleiche erforderlich. Andererseits fallen bei nur 
zwei Alternativen lediglich n Vergleiche dieses Typs an, die zudem immer konsistent sind und somit 
keine C//Ci?-Berechnungen erforderlich machen. 



1.1.2 Varianten 

Neben dem — in der Praxis meistverwendeten — AHP-Standard verfahren bestehen auch AHP- 
Varianten unterschiedlichster Auspragung. Sie beinhalten alternativ verwendbare Grundskalen, die 
zahlenmafiige — beispielsweise logarithmische — Umsetzung und Weiterverrechnung der Vergleich- 
sergebnisse, die Weiterbearbeitung der Untersuchungsergebnisse anhand von Sensitivitatsanalysen, 
etc. Besonders hervorzuheben ist der ANP-Ansatz (Analytic Network Processing), der sich speziell 
zur Bearbeitung komplexer — horizontal und vertikal vernetzter — Modelle eignet (vgl. Saaty 1996). 




476 



2. AHP-Praxis 

Die mehrkriteriell orientierte Evaluation von Verkehrsprojekten bildete einen wichtigen Ausgangspunkt 
fiir die Entwicklung von AHP; zu verweisen ist auf die — volkswirtschaftlich orientierte — Sudan- 
Studie von Thomas L. Saaty. Zahlreiche Folgeuntersuchungen wurden in AHP-Forschungsberichten 
dargestellt; sie verdeut lichen, dafi die AHP-Methodologie zur Projektbeurteilung ex ante und ex post 
verwendbar ist und zudem Projekte mit/ohne Losungsvarianten betrefFen kann. Im Rahmen der fol- 
genden Ausfiihrungen kann lediglich auf eine eng begrenzte Auswahl derartiger Studien eingegangen 
werden. 

2.1 Projekte ohne Losungsvarianten 

2.1.1 Grundkonzepte 

Projekte ohne Losungsvarianten sind generell dadurch charakterisiert, dafi sie im Rahmen von Entschei- 
dungsprozessen lediglich zur Ausfiihrung akzeptiert oder abgelehnt werden konnen. Losungsvarian- 
ten, die im Rahmen von AHP-Studien zu untersuchen waren, bestehen nicht. Der AHP-Ansatz 
ist zunachst gut zur Evaluation von Projekten der vorerwahnten Art verwendbar. Dabei wird 
vorzugsweise mit zwei Submodellen zur Nutzen-/Kostenerfassung gearbeitet; deren Ergebnisse wer- 
den anhand eines N/K-Indexes verglichen und als Basis fiir die abschliefiende Ausarbeitung von 
Entscheidungs-Empfehlungen verwendet . 

2.1.2 Beispiele 

2. 1.2.1 Bau eines SchifFahrtkanals 

Ein in der Literatur ausfiihrlich beschriebenes Beispiel zur Kosten-/Nutzenanalyse betrifft das LEOR- 
Kanalbauprojekt (Lake Erie — Ohio River Canal). Das Originalmodell umfafit insgesamt 18 nutzen- 
und 21 kostenorientierte Kriterien, wobei jeweils mit vier Hauptkriterien zur Biindelung der okonomi- 
schen, sozialen, umweltbezogenen und politikorientierten Subkriterien gearbeitet wird. Die letzteren 
werden im Rahmen entsprechender Nutzen-/Kosten-Modelle bearbeitet; letztlich sind lediglich die 
Alternativen Kanalbau resp. Bauverzicht zu evaluieren (vgl. Rabbani/Rabbani 1996, p. 114 - 137). 

Tabelle 4- LEOR-Kanalprojekt: Nutzenmodell 

LEOR-Nutzenmodell 

Okonomische Nutzen-Kriterien 

Spezielle Wirtschaftsforderung, Bauphase: Beschaftigung Tiefbau 
Spezielle Wirtschaftsforderung, Betriebsphase: Beschaftigung Kanalverkehr 
Allgemeine Wirtschaftsforderung, Kanalregion 

SoziaJe Nutzen-Kriterien 

Schaffung neuer Erholungsgebiete 

Umweltbezogene Nutzen-Kriterien 

Politikorientierte Nutzen-Kriterien 

Vermehrtes nationzdes Interesse an LEOR Regionalproblemen 
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2. 1.2. 1.1 Nutzenmodell 

Die Grundstruktur des Nutzenmodells ist — verkiirzt — aus Tabelle 4 ersichtlich. 

Beziiglich der okonomischen Kriterien (OEKNU-K) werden sechs Subkriterien beriicksichtigt. Sie 
beziehen sich auf die mit dem Kanalbau und -betrieb verbundenen allgemeinen regionalwirtschaftlichen 
Auswirkungen, insbesondere hinsichtlich der Schaffung neuer — temporarer oder permanenter — Ar- 
beitsplatze. Weiterhin werden transportwirtschaftliche Vorteile, sowie Auswirkungen auf die Wasser- 
wirtschaft, etwa hinsichtlich eines verbesserten Hochwasserschutzes, als Subkriterien in das Nutzen- 
modell integriert. 

Die Gruppe der sozialen Kriterien (SOZNU-K) umfaBt lediglich vier Subkriterien. Mit ihnen wird die 
Schaffung neuer Arbeitsplatze resp. Erholungsgebiete, die Verminderung von Uberschwemmungsge- 
fahren durch die Ohio-Flufiregulierung sowie das grofiere nationale Interesse am LEOR-Wirtschafts- 
raum in die Nut zene valuation einbezogen. Hinsichtlich der Umwelt kriterien (UWENU-K) werden 
die Schaffung neuer Wildreservate und die Verbesserung des Umweltbewufitseins der Bevolkerung 
separat ausgewiesen. 

Im politischen Bereich (POLNU-K) erscheinen das grofiere nationale Interesse an der LEOR-Regio- 
nalpolitik und der erhohte Einflufi von Lokalbehorden auf politische Entscheidungen als Subkriterien. 



2. 1.2. 1.2 Kostenmodell 

Das Kostenmodell umfafit — vgl. Tabelle 5 — vier Hauptkriterien, die wiederum stark untergliedert 
sind. 

Tabelle 5. LEOR-Kanalprojekt: Kostenmodell 
LEOR-Kostenmodell 

Okonomische Kosten-Kriterien 
Kanalbaukosten 
Betriebs- und Unterhaltskosten 

Beschaftigungsriickgang lokaler Transportunternehmungen 

Soziale Kosten-Kriterien 

Verlust von Arbeitsplatzen 
Verlust von Erholungsgebieten 

Erhohte Umweltbelastung, geringere Attraktivitat der Region 

Umweltbezogene Kosten-Kriterien 

Beeintrachtigung der Luft-Wasserqualitat 

Politikorientierte Kosten-Kriterien 

Verschiebungen auf Arbeitsmarkt, Verlust angestammter Arbeitsplatze 



Beziiglich der okonomischen Faktoren (OEKKO-K) werden als Subkriterien die Baukosten sowie die 
Betriebs- und Unterhaltskosten separat in den Evaluationsprozefi einbezogen. Als okonomisch rele- 
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vante Kostenfaktoren gelten zudem: Beschaftigungsriickgang bei lokalen Transportunternehmungen, 
Verlagerung von Wohn- und Geschaftshausern, Ausfall von Steuern und Werteinbufien auf Liegen- 
schaften. Zudem werden die mit Neubau (Briicken) resp. Verlegung bestehender Verkehrswege ver- 
bunden Kosten im Kostenmodell separat ausgewiesen. 

Bei den sozialen Faktoren (SOZKO-K) sind — neben den mit der Verlagerung von Wohn- und 
Geschaftshausern, Priedhofen etc. verbundenen Nachteile — die durch den Kanalbau bedingte erhohte 
Umweltbelastung, der Verlust von Erholungsgebieten und angestammter Arbeitsplatze sowie die 
Kosten von Sozialprogrammen als Subkriterien in den EvaluationsprozeB einzubeziehen. 

Die sich auf umweltbedingte Kosten beziehenden Aspekte (UWEKO-K) betrelfen Auswirkungen des 
Kanalbaus auf Tierwelt, Luft- und Wasserqualitat sowie den Grundwasserspiegel. 

Unter den POLKO-K werden Umstrukturierungen auf dem Arbeitsmarkt, Kosten von Sozialpro- 
grammen und staatliche Regulierungen im Anschlufi an den Kanalbetrieb als wesentliche Subkriterien 
zur Projektevaluation ausgewiesen. 

2. 1.2. 1.3 Modellauswertung 

Die Modellauswertung erforderte den Einsatz eines leistungsfahigen AHP-Softwarepaketes (Expert 
Choice). Unter Beizug von Experten mufiten die sich auf das Nutzen- resp. Kostenmodell beziehenden 
acht Evaluationstableaus zunachst separat bearbeitet und die erzielten Ergebnisse anschliefiend zu 
N/K-Ratios zusammengefafit werden. 

Tabelle 6 zeigt — auszugsweise — die fiir das LEOR-Kanalprojekt berechneten Teil- und Gesamtergeb- 
nisse; sie fiihren aufgrund N/K-Ratios zu einer Bauverzichts-Empfehlung. 



Tabelle 6. LEOR-Kanalprojekt: Modellauswertung 



Nutzenmodell 

Projekt- Projekt- 
realisation verzicht 


Kostenmodell 

Projekt- Projekt- 

realisation verzicht 


Okonomische Nutzen-Kriterien 
OEKNU-1 .070 .003 


Okonomische Kosten-Kriterien 
OEKKO-1 .288 .003 


Soziale Nutzen-Kriterien 
SOZNU-1 .107 .003 


Soziale Kosten-Kriterien 
SOZKO-1 .055 .011 


Umweltbezogene Nutzen-Kriterien 
UWENU-1 .012 .014 


Umweltbezogene Kosten-Kriterien 
UMWEKO-1 .007 .001 


Politikorientierte N utzen-Kriterien 


Politikorientierte Kosten-Kriterien 


Total .913 .087 


.935 .067 


N/K-Ratio bei Projektrealisation 
•913/.935 = .976 


N/K-Ratio bei Projektverzicht 
.087/.067 = 1.298 
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2. 1.2.2 Geschwindigkeitsbegrenzungen auf US- Autobahnen 

Die auf amerikanischen Autobahnen mit einer Erhohung der zulassigen Maximalgeschwindigkeit von 
88 auf 105 km/h verbundenen Vor- und Nachteile wurden ebenfalls anhand eines AHP-Modells un- 
tersucht (Rabbani/Rabbani 1996, p. 138 - 145). Dabei wurden — zusatzlich zur Betrachtung primar 
verkehrswirtschaftlicher Aspekte, wie Verkehrsflufi (congestion) und Verkehrssicherheit (safety) — 
auch einzel- und gesamtwirtschaftliche sowie soziale Kriterien in die Analyse einbezogen. Die einzel- 
nen Kriterien wurden im Rahmen von zwei Submodellen — nutzen- resp. kostenorientiert — bear- 
beitet. Das auf insgesamt 16 Subkriterien basierende Gesamtmodell wurde 1994/95 von zwei ver- 
schiedenen Forschergruppen ausgewertet, wobei die zweite — unter Beriicksichtigung neuer Un- 
tersuchungen iiber die Unfallhaufigkeit und deren Auswirkungen (fatalities/injuries) — zu einer 
Befiirwortung der Geschwindigkeitserhohung (yes/no = 1.146/0.919) gelangte. Im Sommer 1995 
erfolgte im US-Bundesstaat Pennsylvania eine entsprechende Anpassung der Verkehrsvorschriften 
(Rabbani/Rabbani 1996, p. 145). 

Die Studie verdeutlicht die Bedeutung intensiver Basisstudien unter Beriicksichtigung neu anfall- 
ender Forschungsergebnisse. Aufierdem ist zu vermerken, dafi ein gut ausgebautes Grundmodell 
wiederholt — moglicherweise unter wesentlich verschiedenen Situationen — verwendet werden kann. 
Diesfalls kommt einer genauen Definition der verwendeten Kriterien und der Erstellung entsprechen- 
der Benutzungsrichtlinien — zur Sicherstellung eines intertemporal und interpersonal einheitlichen 
Vorgehens — zentrale Bedeutung zu. 

2.2 Projekte mit Losungsvarianten 

2.2.1 Grundkonzepte 

Im allgemeinen werden AHP-Modelle zur Beurteilung von Projekten mit mehreren — alternativ real- 
isierbaren — Losungsvarianten eingesetzt, wobei fiber die letzteren eine Rangordnung zu erstellen ist. 
Dabei kann auch ein — grundsatzlich immer moglicher — Projektverzicht explizit in die Analyse ein- 
bezogen werden. Bei derartigen Analysen ist generell darauf zu achten, dafi nur praktisch signifikante 
Losungsvarianten berficksichtigt werden, die zudem merklich voneinander abweichen. Die Zahl der 
in die Untersuchung einzubeziehenden Alternativen ist nach Mfiglichkeit abschliefiend festzulegen. 
Nachtragliche Veranderungen in der Zusammensetzung der Alternativen kann — bei zu wieder- 
holenden paarweisen Vergleichen (relative measurement /distributive mode) — zu Rangumstellungen 
ffihren. Dies etwa dann, wenn der Wegfall bisher getroffener Annahmen fiber Auftretenshaufigkeit 
Oder Kombination von Merkmalen eine wesentlich andere Einschatzung samtlicher Alternativen be- 
wirkt. Diese Situation tritt dann nicht auf, wenn der Alternativenvergleich an externen Benchmarks 
(absolute measurement) ausgerichtet wird oder in Bezug auf projektspezifisch bereits erreichte Best- 
werte (ideal/performance mode) erfolgt. 

2.2.2 Beispiele 

Die im folgenden erwahnten Beispiele orientieren fiber die bei der Evaluation unterschiedlicher 
Verkehrsprojekte mitberficksichtigten Kriterien und deren Integration in die generelle AHP-Modell- 
struktur; auf detaillierte Angaben zur stufenweise — computergestfitzt — durchgeffihrten Model- 
lauswertung wird verzichtet. 
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2.2.2. 1 Metro- Ausbau in Grofistadtregion 

Die Fallstudie zum Metro-Ausbau betrifft den Ausbau des Verkehrsnetzes im Grofiraum Washington. 
Den zustandigen Behorden (WMATA, Washington Metropolitan Area Transit Authority) standen 
zur Verbesserung der Verkehrsverbindung in die Vorstadte von Nord Virginia (Orange line, Vienna 
— Centreville) drei Alternativen zur Diskussion. Zu deren Evaluation konnte das aus Tabelle 7 
ersichtliche AHP-Modell benutzt werden (Saaty /Forman 1993, p. 279). 

Unter den okologischen Kriterien (ecological impacts) wurden insbesondere Auswirkungen eines 
Metro-Ausbaus auf Luft- und Wasserqualitat, Flora und Fauna, sowie Verkehrslarm in den Eval- 
uationsprozefi einbezogen. Beziiglich der Linienfiihrung und der damit verbundenen Bauten sollten 
Auswirkungen auf das allgemeine Landschaftsbild (visual and physical impacts of the system, in- 
cluding elements of line, slope, space and form, scenic resources, design of structures, and physical 
features) evaluiert werden. Unter den sozio-okonomischen und kulturellen Kriterien (socio-economic 
and cultural impacts) waren insbesondere auch langerfristig zu erwartende Auswirkungen des Metro- 
Ausbaus mitzuberiicksichtigen. 

Tabelle 7. Metro-Ausbauprojekt, WMATA. Grundstruktur 
Metro-Ausbau 

Okologische Kriterien. Auswirkungen auf 
Luftqualitat 
Wasserqualitat 

Verkehrsl^m /-erschiitterungen 
Technische Kriterien 

Sozio-okonomische und kulturelle Kriterien. Auswirkungen auf 
Regionale Entwicklung/Beschaftigung 
Strafienverkehr, inklusive Schaffiing von Parkplatzen 
Stadt- /Bevolkerungsentwicklung 

Alternativen: Keine Ver^derung; Netzausbau; Verbesserung im Zubringerdienst. 



2.2.2.2 Verkehrsflufiverbesserungen in einer stMtischen Agglomeration 
Im Grofiraum Washington stand auch der Bau eines neuen Autobahnrings (outer belt way) zur 
Diskussion, der — in Konkurrenz zu anderen Mafinahmen — zu einer Verbesserung des Verkehrs- 
flusses fiihren sollte. Das entsprechende AHP-Modell ist in seiner Grundstruktur aus Tabelle 8 
ersichtlich; besonders zu beachten ist die Differenzierung zwischen — zu maximierenden resp. min- 
imierenden — Subzielen (Saaty/Forman 1993, p. 280). 
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Tabelle 8. Verkehrsflufiverbesserungen in Ballungszentrum 



Globalziel : Verkehrsflufiverbesserung 

Subziel 1: Kostenminimierung (minimize the long and short term costs of the project) 

Landbeschaffung/Durchgangsrechte 
Baukosten 
Unterhaltskosten 

Subziel 2; Projektakzeptanz (maximize public acceptance for the project) 
Verkehrsteilnehmer 
Liegenschaftsbesitzer 

Subziel 3 Aktive Projektunterstiitzung (ensure maximum political support for the project) 
Interessengruppen 
Stimmbiirger 

Subziel 4: Projektfinanzierung (develop the funding alternative that is most fair) 

Staatssteuern 

Strafienbenutzungsgebiihren 
Steuerzuschlage auf TVeibstoffpreisen 
Anstofierbeitrage 

Subziel 5: Projektnutzen (maximize the positive economic impact on the area) 

Wirtschaftswachstum 

Kurzfristige Verbesserung der Verkehrsverh^tnisse (Staureduktion) 
L^gerfristige Verbesserung der Verkehrsverh^tnisse 



Alternativen: Forderung des Privatverkehrs (build outer beltway); Forderung des offentlichen Verkehrs (im- 
prove and expand rapid transit /existing Metro system); Begrenzung des regionalen Wirtschaftswachstums 
(limit growth and development in the area); Forderung flexibler Arbeitszeiten (encourage flex hours). 



3. Zusammenfassung und Ausblick 



Die angefiihrten Beispiele zeigen, dafi die AHP-Methode in mannigfaltiger Weise zur Evaluation 
von Verkehrsprojekten eingesetzt werden kann. Die Grundstruktur mancher Modelle kann zudem 
mit nur geringer Modifikation bei der Bearbeitung ahnlicher Projekte beibehalten werden. Generell 
ist festzuhalten, dafi die AHP-Methodologie zur Planung und Beurteilung grofier Verkehrsprojekte 
auf der Basis einer Vielzahl von Kriterien geeignet ist. Sie kann insbesondere auch zur Evaluation 
nationaler/internationaler Grofiprojekte im Bereich des Eisenbahn- und Strafienbaus benutzt werden, 
wobei auch die Moglichkeit zum Einbezug mehrerer Experten-/Interessengruppen besteht (vgl. Weber 
1993, S. 184 - 185). 

In Europa diirfte sich die AHP-Methode in Zukunft insbesondere im Rahmen von Untersuchun- 
gen der EU vermehrt durchsetzten; entsprechende Programme liegen bereits vor (vgl. deBrucker/ 
deWinne/Peeters/Verbeke/Winkelmans 1995, p. 265). Sehr zu empfehlen ware auch der AHP- 
Einsatz zur langerfristigen Planung des schweizerischen Verkehrsnetzes. 




482 



4. Literatur 

Buede, Dennis M.: Software Review. Three Packages for AHP: Criterium, Expert Choice and 
HIPRE+. Journal of Multi-Criteria Decisions Analysis 1(1992)2, p. 119 - 121. 

de Brucker, K./de Winne, N./Peeters, C./Verbeke, A./Winkelmans, W.: The Economic Evaluation of 
Public Investments in Transport Infrastructure: The Use of Multicriteria Analysis. International 
Journal of Transport Economics 22(1995)3, October, p. 261 - 181. 

Rabbani, Simin J./Rabbani, Soheil R.: Decisions in Transportation with the Analytic Hierarchy 
Process. Campina Grande: UFPB/CCT, 1996. 

Saaty, Thomas L.: The Sudan Transport Study. Interfaces 8(1977)1, Part 2, November, p.37 - 57. 

Saaty, Thomas L.: Multicriteria Decision Making. The Analytic Hierarchy Process. Planlling, 
Priority Setting, Resource Allocation. 2nd ed., Pittsburgh: RWS Publications, 1990. 

Saaty, Thomas L.: Fundamentals of Decision Making and Priority Theory with the Analytic Hierar- 
chy Process. Pittsburgh: RWW Publications, 1994. 

Saaty, Thomas L.: Decision Making with Dependence and Feedback. The Analytic Network Process. 
Pittsburgh: RWS, 1996. 

Saaty, Thomas L./Forman, Ernest H. (eds.): The Hierarchon. A Dictionary of Hierarchies. Pitts- 
burgh: RWS, 1993. 

Weber, Karl: Mehrkriterielle Entscheidungen. Miinchen: Oldenbourg, 1993. 

Weber, Karl: AHP-Analyse. Zeitschrift fiir Planung 6(1995)2, S. 185 - 195. 




Ein Simulated- Annealing- Verfahren fur das 
Standardproblem der Tourenplanung mit 
Zeitfensterrestriktionen 

Dipl.-Inform. Jorg Homberger, FernUniversitat Hagen 
Prof. Dr. Hermann Gehring, FernUniversitat Hagen 

Zusammenfassung: Der Beitrag stellt ein Simulated-Annealing- Verfahren zur Losung des um 

Zeitfensterrestriktionen erweiterten Standardproblems vor. Die Zielfunktion beriicksichtigt zwei Ziel- 
kriterien, die Minimierung der Fahrzeuganzahl als primares und die Minimierung der Gesamtentfer- 
nung als sekundares Kriterium. In der ersten Stufe eines zweistufigen Verfahrensansatzes wird daher 
zunachst - ohne Beriicksichtigung der Gesamtentfernung - ein Tourenplan mit minimaler Fahrzeug- 
anzahl berechnet. Von diesem Tourenplan ausgehend wird dann in der zweiten Stufe die von den 
Fahrzeugen zuriickzulegende Gesamtentfernung minimiert. Der Focus des Beit rags liegt auf den in 
der ersten Verfahrensstufe getroffenen problembezogenen Entwurfsentscheidungen, die insbesondere 
die verwendeten Nachbarschaftsstrukturen und die entwickelte Kostenfunktion betreffen. Zur Be- 
wertung des entwickelten Simulated- Annealing- Verfahrens werden einschlagige Benchmarkprobleme 
von Solomon [18] berechnet und die Ergebnisse mit den besten bekannten Losungen verglichen. 



1. Einfiihrung und Problemformulierung 

Im Rahmen der Tourenplanung werden Problemstellungen untersucht, bei denen Auslieferungs- oder 
Sammelauftrage zu effizienten Fahrzeugtouren zusammenzufassen sind. Nachfolgend wird das um 
Zeitfensterrestriktionen erweiterte Standardproblem der Tourenplanung betrachtet, das auch als ’’Ve- 
hicle Routing Problem with Time Windows” (VRPTW) bezeichnet wird. Das VRPTW lafit sich wie 
folgt beschreiben (vgl. [5]): 

Innerhalb einer Planungsperiode sind von einem Depot aus n Kunden mit Fahrzeugen derselben 
Fahrzeugkapazitat zu bedienen. Fiir jeden Kunden sind ein Bedarf, eine Bedienungsdauer und ein 
Zeitfenster gegeben. Der Bedarf eines Kunden ist durch genau eine Bedienung innerhalb des Kun- 
denzeitfensters zu decken. Jedes Fahrzeug darf innerhalb der Planungsperiode nur einmal eingesetzt 
werden. Der friiheste mdgliche Abfahrtzeitpunkt vom Depot sowie der spateste Zeitpunkt, zu dem 
ein Fahrzeug das Depot wieder erreicht haben mufi, sind ebenfalls vorgegeben. Die Standorte des De- 
pots und der Kunden sowie die kiirzesten Entfernungen und die entsprechenden Fahrzeiten zwischen 
den einzelnen Kunden und zwischen dem Depot und den Kunden seien als bekannt vorausgesetzt. 
Als Optimierungsziel wird in der Literatur in der Regel mit erster Prioritat die Minimierung der 
Fahrzeuganzahl und mit zweiter Prioritat die Minimierung der von den Fahrzeugen zuriickzulegenden 
Gesamtentfernung zugrundegelegt (vgl. [18]). 

Das VRPTW ist ein kombinatorisches Optimierungsproblem, das aufgrund seiner Komplexitat der 
Klasse der NP-schweren Problemen zuzuordnen ist (vgl. [9]). In den letzten Jahren konnten mit 
sogenannten ’’modernen heuristischen Suchverfahren” , wie z.B. Evolutionsstrategien (vgl. [7]) und 
Tabu Search (vgl. [19]), bereits gute Ergebnisse erzielt werden. 
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Im Rahmen dieses Beitrags wird ein Simulated- Annealing- Verfahren fiir das VRPTW vorgestellt. 
Der Fokus liegt auf der Beantwortung der Pragestellung, wie mit einem Nachbarschaftssuchver- 
fahren die Fahrzeuganzahl eines Tourenplans systematise!! reduziert werden kann. Zur Bewertung des 
entwickelten Simulated- Annealing- Verfahrens werden einschlagige Benchmarkprobleme von Solomon 
[18] berechnet und die Ergebnisse mit den besten bekannten Losungen verglichen. 

2 . Simulated- Annealing- Verfahren 

Der Ansatz, kombinatorische Optimierungsprobleme mit Simulated Annealing (SA) zu losen, wurde 
unabhangig voneinander von Kirkpatrick et al. [8] und Cerny [2] entwickelt. SA basiert auf der 
Grundlage des von Metropolis et al. [10] entwickelten Algorithmus zur Simulation der Abkiihlung 
von Festkorpern. 

Ausgehend von einer Startlosung wird im Rahmen des SA eine stochastische Nachbarschaftssuche 
durchgefiihrt. Zu diesem Zweek wird in jeder Iteration aus der Nachbarschaft N{s) der aktuellen 
Losung s eine Nachbarlosung s' € N{s) zufallig ausgewahlt. Die Entscheidung, ob s' als Folgelosung 
akzeptiert wird, d.h. die Suche von der ausgewahlten Nachbarlosung s' weitergefiihrt wird, hangt von 
den Kosten c{s) und c(s') der Losungen ab. Die Nachbarlosung s' wird in jedem Fall akzeptiert, falls 
sie eine Verbesserung gegeniiber s darstellt, d.h. niedrigere Kosten als s aufweist. Hingegen wird die 
Nachbarlosung s' nur mit einer Wahrscheinlichkeit akzeptiert, falls s' mit hoheren Kosten als s be- 
wertet wird. Die Wahrscheinlichkeit p = exp(— Ac/t) eine schlechtere Losung zu akzeptieren ist von 
der Kostendifferenz Ac = c(s') — c(s) und dem als ’’Temperatur” t bezeichneten Verfahrensparame- 
ter abhangig. Der Verfahrensparameter t wird wahrend der Suche entsprechend einem sogenannten 
’’Kiihlplan” variiert. Haufig wird eine geometrische Funktion a{t) = at, a < 1, zur Reduktion der 
Temperatur herangezogen. Die pro Temperatur durchzufiihrende Anzahl an Iterationen nrep wird 
gewohnlich in Abhangigkeit der Problemgofie festgelegt. 

Die Entwicklung eines Simulated- Annealing- Verfahrens fiir ein bestimmtes kombinatorisches Opti- 
mierungsproblem erfordert zum einen problemspezifische Entwurfsentscheidungen, insbesondere die 
Definition einer Nachbarschaftsstruktur und die Wahl einer geeigneten Kostenfunktion, sowie zum 
anderen generische Entwurfsentscheidungen wie z.B. die Festlegung der Starttemperatur to und die 
Wahl eines geeigneten Kiihlplans (vgl. [15]). 

3. Entwickeltes Simulated- Annealing- Verfahren fur das VRPTW 

Den meisten der in der Literatur diskutierten Losungsverfahren liegen Ansatze zugrunde, bei de- 
nen gleichzeitig die Fahrzeuganzahl und die von den Fahrzeugen zuriickzulegende Gesamtentfer- 
nung minimiert wird (vgl. [14], [19], [3]). Wie Retzko [16] zeigt, wachst die von den Fahrzeugen 
zuriickzulegende Gesamtentfernung, falls die Zahl der eingesetzten Fahrzeuge unter einen bestimmten 
Wert weiter reduziert wird. Aus diesem Grunde kann umgekehrt - ausgehend von einem gegebenen 
Tourenplan - die Suche nach einem Tourenplan mit einer geringeren Fahrzeuganzahl durch das gleich- 
zeitige Verfolgen des Ziels der Verringerung der Gesamtentfernung erschwert werden. Daher wird 
nachfolgend ein zweiphasiger Verfahrensansatz gewahlt. Wahrend in der ersten Phase zunachst ver- 
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sucht wild, ohne Beriicksichtigung der Gesamtentfernung einen zul^sigen Tourenplan mit einer mini- 
malen Fahrzeuganzahl zu berechnen, wird ausgehend von der in der ersten Phase berechneten Losung 
in der sich anschliefienden zweiten Phase die von den Fahrzeugen zuriickzulegende Gesamtentfernung 
minimiert. Den beiden Phasen des entwickelten Simulated-Annealing- Verfahrens liegen jeweils un- 
terschiedliche Entwurfsentscheidungen zugrunde. Im Rahmen dieses Beit rags wird im wesentlichen 
auf die problemspezifischen Entwurfsentscheidungen der ersten Phase eingegangen. 

3.1 Erste Verfahrensphase 

Zu Beginn des Verfahrens wird mit Hilfe des Savingsverfahren von Clarke und Wright [4] eine zu- 
lassige Startlosung fiir die nachfolgende Nachbarschaftssuche berechnet. 

Die der Nachbarschaftssuche zugrunde liegenden problembezogenen Entwurfsentscheidungen, ins- 
besondere in Bezug auf die verwendete Nachbarschaftsstruktur und die Kostenfunktion, sowie die 
durch den Kiihlplan festgelegten verfahrensbezogenen Entwurfsentscheidungen werden nun beschrie- 
ben. 

N achbarschaftsstruktur 

Fiir die iterative Berechnung von Nachbarlosungen wahrend der ersten Phase wird hier das kom- 
plexere Konzept des ’’Compound-Move” verwendet. Compound-Moves setzen sich aus verschiedenen 
einfachen Move-Operatoren zusammen. Nach Glover [6] und Osman [12] stellen sie ein besonders ef- 
fektives Konzept dar. Im gegebenen Fall werden zwei einfache Move-Operatoren in einen Compound- 
Move einbezogen: 

• Ein aus der Menge MoveSet = {Or-Opt-Move, 20pt*-Move, 1-Single-Interchange-Move} zufal- 
lig ausgewahlter Move-Operator. 

• Ein modifizierter Or-Opt-Move, der dem Ziel der Verringerung der Fahrzeuganzahl durch 
weitestgehende Auflosung einer Tour dient. 

Die Operatoren der Menge MoveSet gehen auf folgende Autoren zuriick: Dem Or-Opt-Move liegt 
ein Tauschkonzept von Or [11] zugrunde. Explizit unter der Bezeichnung ’’Move” fiihrten Osman 
[12] den 1-Single-Interchange-Move und Potvin et al. [13] den 20pt*-Move ein. 

Ausgehend von einem zul^sigen Tourenplan tp wird eine zulassige Nachbar losung tp" in zwei Schrit- 
ten berechnet. 

Im ersten Schritt wird mit Hilfe eines aus der Menge MoveSet zufallig gewahlten Move-Operators 
zunachst eine zul^sige und vorlaufige Nachbar losung tp' berechnet. Exemplarisch sei dieser Schritt 
nachfolgend fiir den Fall des Or-Opt-Move beschrieben. Ein Or-Opt-Move fafit eine Ausfiige- und 
eine Einfiigeoperation zusammen. Mittels dieser Operationen wird eine Sequenz von einem oder 
mehreren Kunden aus einer Tour t des Tourenplans tp ausgefiigt und anschliefiend an einer anderen 
S telle von tp wieder eingefiigt. Im Falle nur eines Kunden gilt: 

Ein Kunde i* wird aus der ihm bedienenden Tour t = (0, ..., 2 * ...,0) ausgefiigt, indem die 
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verbleibenden Kunden der Tour t unter Ausschlufi des Kunden i* in der urspriinglichen Reihenfolge 
bedient werden. Die um den Kunden i* reduzierte Tour t\ lautet dann: t\ = ( 0 , z!_, 2!;., 0 ). 
Handelt es sich bei der Tour t um eine Pendeltour, d.h. das Fahrzeug bedient ausschliefilich den 
Kunden i*, so wird der Kunde i* aus der Tour t ausgefiigt, indem die Tour t aufgelost wird. 

Bin ausgefiigter Kunde i* wird in eine Tour ^2 des Tourenplans eingefiigt, indem die Tour t2 nur 
die Bedienung des Kunden i* erweitert wird. Wird die Einfiigestelle durch zwei aufeinanderfolgende 
Orte {j,j+) bezeichnet, so gilt: ^2 = ( 0 ? .--jO). Durch das Einfiigen des Kunden i* ergibt sich 

die Tour = { 0 , Grundsatzlich ist es nicht ausgeschlossen, dafi ein Kunde wieder 

in die Tour eingefiigt wird, aus der er ausgefiigt wurde. Allerdings kann sich dann die Einfiigestelle 
von der Ausfiigestelle unterscheiden. Sowohl die Sequenz der auszufiigenden Kunden als auch die 
Einfiigeposition werden, unter Beriicksichtigung der Restriktionen, zufallig ausgewahlt. 

Im zweiten Schritt wird aus dem berechneten Tourenplan tp' mittels eines modifizierten Or-Opt- 
Operators die endgiiltige Nachbarlosung tp" berechnet. Der modifizierte Or-Opt-Operator zielt auf 
die Berechnung eines Tourenplans mit einer geringeren Fahrzeuganzahl ab. Zu diesem Zweck wird 
versucht, diejenige Tour tg € tp' des Tourenplans tp' weitestgehend aufzulosen, welche die geringste 
Anzahl von Kunden aufweist. tg wird nachfolgend auch als ’’kleinste” Tour bezeichnet. Zu diesem 
Zweck wird das Einfiigen von Kunden in andere Touren wie folgt prazisiert: Die Kunden der Tour 
tg werden entsprechend ihrer Bedienungsreihenfolge nacheinander in je eine Tour t ^ tg,t G tp' 
eingefiigt, sofern die Restriktionen des VRPTW nicht verletzt werden. Existieren fiir einen Kunden 
k etg alternative Einfiigestellen (2,2+) in anderen Touren, so sind zwei Falle zu unterscheiden: 

Falls ein Index 2+ einen Kunden bezeichnet, wird die durch das Einfiigen bewirkte Verschiebung 
der friihestmoglichen Abfahrtszeit bei dem Kunden 2+ als Entscheidungskriterium herangezogen. 
Falls ein Index 2+ das Depot bezeichnet, dient dagegen die durch das Einfiigen bewirkte Ver- 
schiebung der friihest moglichen Ankunftszeit am Depot 2-,. als Entscheidungskriterium. Existieren 
mehrere zul^sige Einfiigestellen mit gleicher minimaler Verschiebezeit, so wird diejenige Einfiigestelle 
gewahlt, fiir die die zusatzlich zu fahrende Strecke ein Minimum annimmt. Sofern kein Kunde der 
Tour tg wegen drohender Verletzung der Restriktionen in eine andere Tour t ^ tg,t e tp', eingefiigt 
werden kann, wird keine Veranderung an tp' vorgenommen. 

Kostenfunktion 

Die ’’Kosten” eines Tourenplans tp - im weiteren Sinne - konnen prinzipiell durch den Zielfunktions- 
wert, also in der ersten Phase durch die Fahrzeuganzahl cf{tp), ausgedriickt werden. In dem Fall, dafi 
zwei zu bewertende Tourenplane die gleiche Fahrzeuganzahl aufweisen, werden beide Tourenplane 
auch mit den gleichen Kosten bewertet. Eine Entscheidung, welche Suchrichtung zu praferieren ist, 
kann in diesem Fall nicht ohne weiteres getroffen werden. Aus diesem Grunde wurden zwei weitere, 
die Kosten beschreibende Kennzahlen gewahlt, welche die Aussicht auf den Erfolg der Einsparung 
eines weiteren Fahrzeugs mit einbeziehen. Auf diese Weise wird die Suchrichtung im Hinblick auf 
eine bessere Zielerreichung des primaren Zielkriteriums festgelegt. 
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Die beiden zusatzlich entwickelten Kennzahlen beziehen sich auf die kleinste Tour ts eines Touren- 
plans tp und schatzen auf unterschiedliche Art ab, wie leicht die Tour ts im Zuge der weiteren Suche 
vollstandig aufgelost werden kann. 

Die erste, einfache Kennzahl ck{tp) bezeichnet die Anzahl der Kunden, die in der Tour tg bedient 
werden. Unterstellt wird also, dafi ein kleiner Wert fiir ck{tp) die vollstandige Auflosung der Tour ts 
begiinstigt. 



Mit der zweiten Kennzahl cv{tp) wird die hier so bezeichnete ’’drohende Verletzung der Zeitfen- 
sterrestriktionen” berechnet. Mit der Kennzahl wird abgeschatzt, um wieviele Zeiteinheiten die 
Zeitfensterrestriktionen verletzt werden, falls man die Tour tg vollstandig auflosen wiirde. In dem in 
Abb. 1 dargestellten Beispiel wird abgeschatzt, um wieviele Zeiteinheiten die Zeitfensterrestriktio- 
nen verletzt werden, wenn der Kunde k = 4 der kleinsten Tour zwischen die Orte i = 2 und = 3 
eingefiigt wird. 




Geographische Darstellung einer Tour: 



J.egende: | 


■I 


zv, 1 




ZVj ; 


[ ] 


Zeitfenster 




Bedienungszeit 




Wartezeit 


— 


BedicnungsreLhenfolge 



Zeitliche Darstellung einer Tour: 




Nachdem die Kunden 1 und 2 innerhalb ihres Zeitfensters bedient wurden, erreicht das Fahrzeug 
den eingefiigten Kunden A: = 4 aufierhalb des Bedienungszeitfensters. Die Bedienung des eingefiigten 
Kunden wird in Bezug auf den spatest zul^sigen Beginn der Bedienung um die dargestellte Zeit 
zvi verzogert. Durch Riickwartsrechnung ausgehend vom Depot lafit sich die spateste Ankunftszeit 
sa bei dem Kunden i+ = 3 fiir den Fall berechnen, dafi das Fahrzeug gerade noch rechtzeitig zu 
dem Depot zuriickkehrt. Kommt das Fahrzeug jedoch, wie in Abb. 1 angenommen, erst nach dem 
Zeitpunkt sa an, so ergibt sich eine positive Zeitdifferenz, die in der Abb. 1 ebenfalls dargestellte Zeit- 
fensterverletzung zv 2 . Durch Addition der berechneten Zeitfensterverletzungen zvi und zv 2 erhalt 
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man fiir das Beispiel einen Wert fiir die drohende Verletzung der Zeitfensterrerstriktionen, falls man 
die kleinste Tour auflosen wiirde. 

Uber die Akzeptanz der durch einen Compound-Move berechneten Nachbarlosung tp" wird nun wie 
folgt entschieden: Sofern die berechnete Nachbarlosung tp" € N(tp) beziiglich einer der drei Kenn- 
zahlen c/, ck oder cv eine Verbesserung gegeniiber der aktuellen Losung tp darstellt, wird tp" als 
neue aktuelle Losung akzeptiert und die Suche von tp'* ausgehend fortgefiihrt. In dem Fall, dafi die 
Nachbarlosung tp" sowohl einen grofieren Wert ck als auch einen grofieren Wert cv aufweist, wird die 
Nachbarlosung nur mit der Wahrscheinlichkeit p = exp{—Acv/t) akzeptiert. Es sei erganzend be- 
merkt, dafi sich die Fahrzeuganzahl cf aufgrund der verwendeten Move-Operatoren nicht vergrofiern 
kann. 

Kiihlplan 

Was die generischen Entwurfsentscheidungen der ersten Phase anbelangt, sei bemerkt, dafi im wesent- 
lichen die Konzepte von Van Laarhoven und Aarts [21] angewandt werden. Wie einfiihrend bereits 
beschrieben, wird die Temperatur entsprechend der Funktion a(t) = at reduziert. Die fiir jede Tem- 
peratur durchzufiihrende Anzahl an Abkiihlungsschritten nrep{n) = bn wird in Abhangigkeit der 
Problemgrofie n festgelegt. Die Phase 1 wird abgebrochen, sobald eines der nachfolgenden Stoppkri- 
terien erfiillt ist: 

• Innerhalb von ks Kiihlschritten erfolgt keine Auflosung einer Tour. 

• Die Temperatur fallt unter den kritischen Wert tmin- 

• Die Auflosung einer weiteren Tour ist aufgrund der begrenzten Ladekapazitaten der Fahrzeuge 
nicht moglich. 

3.2 Zweite Verfahrensphase 

Ausgehend von dem in der ersten Verfahrensphase berechneten besten Tourenplan wird in der zweiten 
Verfahrensphase die Gesamtentfernung minimiert. Die zu diesem Zweck fiir die zweite Verfahrens- 
phase getroffenen Entwurfsentscheidungen werden nachfolgend kurz beschrieben. 

Im Unterschied zur ersten Verfahrensphase wird bei der Berechnung einer Nachbarlosung auf die 
Anwendung des modifizierten Or-Opt-Move verzichtet. In jeder Iteration wird lediglich ein aus der 
Menge MoveSet = {Or-Opt-Move, 20pt*-Move, 1-Single-Interchange-Move} zufallig ausgewahlter 
Move-Opertator angewendet. 

Die Kosten eines Tourenplans werden in der zweiten Phase durch die Gesamtentfernung ce{tp) aus- 
gedriickt. Eine berechnete Nachbarlbsung tp* wird auf jeden Fall akzeptiert, falls sie eine geringere 
Gesamtentfernung als tp aufweist, also ce{tp') < ce{tp) gilt. In dem Fall, dafi die Nachbarlosung tp* 
eine Verschlechterung gegeniiber tp darstellt, also Ace > 0 mit Ace = ce{tp') — ce{tp) gilt, wird tp' 
lediglich mit der Wahrscheinlichkeit p = exp(-Ace/t) akzeptiert. 

Die Temperatur wird zu Beginn der zweiten Phase neu initialisiert. Die Berechnung der neuen Start- 
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temperatur und deren Variation warden entsprechend den Konzepten von Van Laarhoven und Aarts 
[21] nach einem polynomialen Kiihlplan durchgefiihrt. 

4. Evaluierung 

Das entwickelte Verfahren wurde in C implementiert und einem Test unterzogen. Samtliche Pro- 
blemberechnungen wurden auf einem PC (Pentium Prozessor, 200 MHz) durchgefiihrt. Nachfolgend 
wird kurz auf die Festlegung der Verfahrensparameter eingegangen und die bei dem Test erzielte 
Losungsqualitat beschrieben. 

Im Rahmen einer durchgefiihrten Parameterstudie haben sich die in der folgenden Tab. 1 dargest ell- 
ten Parameterwerte als geeignet herausgestellt. Diese wurden fiir alle Berechnungen beibehalten. 



Tab. 1: Parameterwerte 



^0 


a 


± 


ks 


train 


1000 


0.9925 


0 


1000 


100 



Berechnet wurden die von Solomon [18] beschriebenen 56 Probleminstanzen, die in der Literatur 
liblicherweise zur Verfahrensbewertung herangezogen werden. Die Ergebnisse wurden mit den besten 
bekannten Losungen verglichen. Wie die Tab. 2 zeigt, konnte mit dem entwickelten Simulated- 
Annealing- Verfahren bzgl. des primaren Zielkriteriums ” Fahrzeuganzahl” die besten Werte berech- 
net werden. 



Tab. 2: Vergleich kumulierter Ergebnisse fiir die 56 Problembeispiele von Solomon [18] 



Referenzverfahren 


Kumulierte Fahrzeuganzahl 


Kumulierte Gesamtentfernung 


Homberger und Gehring (SA) 


405 


57616 


Homberger und Gehring [7] 


406 


58921 


Taillard et al. [19] 


410 


56694 


Bachem et al. [1] 


414 


61523 


Rochat und Taillard [17] 


415 


57231 


Thangiah et al. [20] 


419 


58951 


Potvin et al. [13] 


422 


62571 


Chiang und Russell [3] 


422 


65175 


Potvin und Bengio [14] 


426 


63530 



Ein Vergleich bzgl. des sekundaren Zielkriteriums ’’Gesamtentfernung” ist bei unterschiedlicher 
Fahrzeuganzahl nur bedingt moglich, da die Einsparung eines Fahrzeugs eine Erhohung der Fahr- 
strecke zwingend voraussetzen kann. Um die Losungsqualitat bzgl. der Gesamtentfernung besser 
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beurteilen zu konnen, wurde daher ein Vergleich mit den besten von Taillard [19] berechneten 
Losungen fiir die 51 Probleminstanzen durchgefiihrt, bei denen die gleiche Fahrzeuganzahl vorlag. 
Fiir die Probleminstanzen (aufier R104, RllO, R112, R207 und RC202) konnte mit dem Verfahren von 
Taillard [19] eine kumulierte Gesamtentfernung von 52496 Einheiten, mit dem Simulated- Annealing- 
Verfahren eine kumulierte Gesamtentfernung von 52149 Einheiten berechnet werden. Insgesamt ist 
zu bemerken, dafi fiir die in Tab. 4 aufgefiihrten Probleminstanzen neue Best losungen beziiglich der 
Gesamtentfernung berechnet wurden. 



Tab. 3: Neue Best losungen 



Probleminstanz 


Bestes Literaturergebnis (vgl. [19]) 


Beste berechnete Losung 


R107 




1126,69 


10 


1120,80 






1214,54 


11 


1194,73 


Rill 




1104,83 


10 


1099,46 




D 


1214,28 


3 


1198,40 




3 


951,59 


3 


946,11 


R204 


m 


941,76 


m 


854,84 




m 


1038,72 


m 


1013,47 




m 


932,47 


3 


913,65 




m 


959,47 


3 


914,12 




m 




3 


955,38 


R211 


2 


923,80 


2 


910,05 


RC203 


3 


1112,55 


3 


1052,09 


RC204 


3 


831,69 


3 


799,28 


RC205 


4 


1328,21 


4 


1320,82 


RC208 


3 


847,90 


3 


835,04 



Um die Robustheit des entwickelten Simulated- Annealing- Verfahrens in Bezug auf die Fahrzeugan- 
zahl zu ermitteln, wurde jede Probleminstanz fiinf mal berechnet. Jede der fiinf Berechnungen einer 
Probleminstanz wurde mit unterschiedlichen Zufallszahlen fiir die Operanden-Auswahl, die Akzep- 
tanz einer Nachbarlosung usw. durchgefiihrt. Im folgenden wird die einmalige Berechnung aller 
Probleminstanzen als Berechnungslauf bezeichnet. In der Tab. 4 werden fiir den besten und den 
schlechtesten der fiinf Berechnungslaufe die kumulierte Fahrzeuganzahl und die zur Ermittlung der 
besten Losungen benotigte mittlere Rechenzeit angegeben. Im Vergleich des besten mit dem schlech- 
testen Berechnungslauf fallt auf, dafi die berechneten kumulierten Werte fiir die Fahrzeuganzahl 
lediglich um zwei Fahrzeuge voneinander abweichen. In Bezug auf die mit dem besten Berech- 
nungslauf insgesamt erzielte Fahrzeuganzahl betragt die Abweichung somit lediglich 0.5%. Die erste 
Phase des Simulated Annealing kann somit als sehr robust bezeichnet werden. 

Die skizzierten Ergebnisse zeigen, dafi die hier im Vordergrund stehende erste Phase des entwickelten 
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Tab. 4: Vergleich kumulierter Ergebnisse fiir die 56 Problembeispiele von Solomon [18] 



Lauf 


Kumulierte Fahrzeuganzahl 


Mittlere Rechenzeit 


Bester Berechnungslauf 


406 


69 sec 


Schlechtester Berechnungslauf 


408 


41 sec 



Simulated- Annealing- Verfahrens gut geeignet ist, fiir Probleminstanzen des VRPTW Losungen mit 
einer geringen Fahrzeuganzahl zu berechnen. Dariiber hinaus kann die mit der zweiten Phase erzielte 
Gesamtentfernung als ordentlich bezeichnet werden, so dafi man insgesamt die zweiphasige Konzep- 
tion des Simulated- Annealing- Verfahrens als einen geeigneten Losungsansatz fiir Probleminstanzen 
des VRPTW bezeichnen kann. 
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A Genetic Algorithm Approach to a Terminus 
Re-Scheduling Problem 



S.J. Massey, Wheatstone Laboratory, King’s College London 
J.F. Boyce, Wheatstone Laboratory, King’s College London 

Summary: A genetic algorithm formulation is applied to the problem of re-routing trains through 
the arrival, storage and departure platforms of a terminus whenever the arrival times are subject to 
disruption. The method is applied to real data for a typical terminus and shown to achieve acceptable 
routings within feasible computing times. 

1. Statement of Problem 

The problem is that of generating a shunting sequence for a set of trains which are approaching 
the terminus such that the trains depart the terminus on time. The problem is well modelled as a 
constraint satisfaction optimisation problem. The nature of the constraints and the function to be 
optimised follow from an analysis of the rail operators’ requirements, the (mechanical) safety system 
and consideration of possible solution implementations. 

A terminus-shunting problem is more complicated than a line-management problem. In the latter 
problem the routes of the trains are defined by their published timetables. The details of the routes 
within particular areas of the rail network can be resolved as long as track capacities are taken into 
account by the rescheduling algorithm. However in the terminus-shunting problem it is required that 
all of the routes through the terminus that are taken by the trains be assigned by the algorithm. 
As trains on different routes interact strongly, the choice of routes is critical. A terminus-shunting 
problem is thus both a train routing and rescheduling problem. 

A diagram of a typical terminus is presented in Figure 1. It comprises eight nodes between which 
trains may move under the safety system. The entrance node. Track 1, represents the extended line 
of track to the next station. Trains arrive on this track under the control of a line-traffic control 
system, (central traffic manager). Once immediately outside the terminus control is passed to the 
terminus control system, (local trafl&c manager). The exit node. Track 2, represents the extended 
line of track to the next station on the line. Trains that are passed to this node are assumed to 
continue along the line under the control of the line-manager control system. The maximum number 
of trains that may be controlled by the terminus control system is seven. 

A train may take six direct routes through the terminus. These routes are defined in Table lin terms 
of route-sections as indicated by the arrows in Figure 1. A route section connects two terminus nodes 
and has an associated direction. In Figure 1 the route sections are defined by the solid arrows between 
nodes. 

A route is defined to be a sequence of route-sections through the terminus from the entrance node to 
the exit node in which passengers are unloaded and new passengers loaded. Routes may be classified 
as direct or indirect. A direct route is one that does not utilise a platform except for loading or 
unloading passengers, i.e. does not utilise it as a storage area. There are a number of additional 
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Figure 1: Routes and route sections 



Route 1 


- 


Track 1 - Platform 2 


Platform 2 - Track 2 


- 


Route 2 


Track 1 - Platform 1 


Platform 1 - Storage 1 


Storage 1 - Platform 2 


Platform 2 - Track 2 


Route 3 


Track 1 - Platform 1 


Platform 1 - Storage 2 


Storage 2 - Platform 2 


Platform 2 - Track 2 


Route 4 


Track 1 - Platform 1 


Platform 1 - Storage 2 


Storage 2 - Platform 3 


Platform 3 - Track 2 


Route 5 


Track 1 - Platform 1 


Platform 1 - Storage 3 


Storage 3 - Platform 2 


Platform 2 - Track 2 


Route 6 


Track 1 - Platform 1 


Platform 1 - Storage 3 


Storage 3 - Platform 3 


Platform 3 - Track 2 



Table 1: Direct routes through the terminus 



route-sections that might be used by a train in order to delay its departure; Storage 1 - Platform 1 
(the dashed arrow in Figure 1). Such a route-section would be part of an indirect route through the 
terminus. Indirect routes have been excluded in order to maintain uniformity of behaviour, and thus 
enhance safety. 

Route-sections are specified in Table l.in the context of defining the direct routes through the 
terminus. Every direct route through the terminus is composed of a number of the twelve direct 
route-sections. Some sections of track are common to more than one route-section. Trains on 
separate route-sections may collide on track-sections that form part of more than one route-section. 
The common track shared by route-sections greatly increases the complexity of the problem. Trains 
on separate routes impose delays on one another through their common track-sections. 

During a typical time window, 14 incoming trains are timetabled to arrive at the terminus separated 
by intervals of 120 to 150 seconds. If it is assumed that trains are assigned routes from only the set 
of direct routes through the terminus, then, as each of the incoming 14 trains may have assigned to 
it one of six routes, there are 6^^ , i.e. more than 7.8 x 10^°, possible route-assignments. Each of 
these route-assignments may itself be considered as a train-delay rescheduling optimisation problem 
with either 2 or 4 train-delay variables for each of the 14 incoming trains. 

The following constraints apply to the routes and schedules. 

1. Trains must not collide, and a safe headway must be maintained between departing trains. 



2. Trains must not depart the terminus before their scheduled departure times. 
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3. Subsequent to any disruption to the time-table upon arrival, the scheduled departure times 
should be recovered in a smooth manner. 

Train collisions are avoided within the terminus through a track reservation system which ‘locks’ 
tracks. When a train is about to move from one node of the terminus to another the necessary 
track is locked. The individual track sections that make up the route section between two nodes are 
unlocked as the train passes them. Collisions are thereby avoided. The train headway conflict is an 
expression of the minimum safe headway of 90 seconds between trains on the open line outside the 
terminus. A breach of the minimum headway constraint is considered to be as serious as a collision. 
As passengers will expect a train to be at the departure platform up to the last possible second, 
trains should not depart prior to their scheduled departure times. The above three requirements are 
hard constraints in that a valid timetable may not violate them. 

It is desirable for trains departing the terminus to have destinations that alternate between the two 
final destinations, so that the number of passengers does not build up on the platforms during peak 
periods. The destination alternation constraint pertains because each train is associated with a driver 
and each driver is associated with one route. This means that there are effectively two types of trains 
in the system, corresponding to the two alternative destinations. 

The requirement of having alternating types of departing trains may be stated as each train departing 
the terminus should be of a different type from the preceding. The destination alternation constraint 
is a soft constraint, since, in certain situations, it is necessary that this constraint be violated. A 
simple example is any shunting sequence that is in operation when one of the lines is blocked for an 
extended period due to an accident. Another valid violation of the behaviour would occur for the 
best solution when a train is cancelled in a uniformly alternating arrival sequence of on-time trains. 
In this case the best shunting solution is one which almost retains the arrival order as the departure 
order. At a local level it is undesirable for the shunting sequence to keep swapping the order of trains 
as they pass through the terminus to cause the trains to alternate perfectly, since this would impose 
additional delays. 

2. Movement Strategy 

2.1 Movement Rule 

It is necessary to define a strategy of movement for trains within the terminus. Experience of 
line-management scheduling [4] suggests that a good first approximation is to mimic the movement- 
strategy of a dispatcher, since it will be acceptable and because it is likely to be near-optimal. The 
chosen objective, therefore, is that each arriving train should be made available for departure as soon 
as is feasible. Trains attempt to advance through the terminus as rapidly as possible until they reach 
their departure platform. The trains wait there until the departure time scheduled in the time-table 
provided by the central traffic manager. If some track on the next route section of a train’s route is 
locked then the train waits until the track becomes free. 

It is conceivable that a better schedule might be obtained by a train waiting in a storage location 
to permit a delayed train to precede it over a shared track-section, however the implied increase of 
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algorithmic complexity offsets the gain during an infrequent event. 

2.2 Cost-Function 

A complete set of route-assignments for the trains that are to pass through the terminus within a 
given time-window, a route-configuration, is represented as a sequence of integers. Each location in 
the sequence corresponds to an arriving train, the associated integer representing the assigned route 
through the terminus. As time progresses, the time-window translates, integers may be removed 
from the beginning of the sequence and added to the end, so that the total length remains constant. 
In this manner the algorithm may continuously adapt a population of route-assignments to a time- 
developing terminus system. 

The cost associated with each route-assignment is the delay in departure of the corresponding train, 
the difference of its actual time of departure from the published time of departure. In general, costs 
associated with train-delays do not combine linearly: it is preferable to delay each of a number of 
trains by a slight amount rather than delay a single train by a large amount. However, since the 
train-movement strategy is to move each train as soon as possible, delays will be naturally distributed 
among trains, and so costs may be combined linearly: the cost of a route-configuration is the sum of 
the costs of the component route-assignments. 

2.3 A Genetic Algorithm Search Algorithm for Terminus Route Assignments 

Apart from their effective generality of search and natural implementation in distributed systems [2], 
the properties of genetic algorithms that make them attractive for the implementation of a terminus 
control system are; 

1. their ability to capture the characteristics that are peculiar to a given terminus, so that the 
system adapts to the particular terminus; 

2. their capacity to maintain a pool of good quality solutions from which candidate solutions 
may be evolved, covering disjoint regions of the search space, so that a long-term memory is 
established, with even infrequent disruptions to the time-table being recognised as having been 
encountered previously; 

3. their ability to evolve with time, so that background trends, such as a shift between rush-hour 
and normal or week-day and week-end time-tables can be accomodated. 

The chosen representation of a route-configuration, a sequence of integers, maps naturally onto the 
simplest form of chromosome of a genetic algorithm, a string, whose locations determine the genes, 
with values, chosen from a finite alphabet, determining the associated characteristics. In the terminus 
problem the trains are the genes; the routes are the characteristics. 

Although a binary representation using a three-bit code would be optimal [3], for simplicity of 
interpretation, we have chosen to represent the six direct routes by integers [0, . . . , 5]; hence a possible 
chromosome, representing the routes of ten trains, is 

A = {1 2 3 4 5 6 1 2 3 4). 
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For similar reasons we have utilised haploid rather than diploid chromosomes and have not investi- 
gated the effect of dominant/recessive genes. Following the notation of Goldberg [2], let there be a 
population A(t) = {Ai, z = 1, . . . N} of N chromosomes at generation t. The cost g (A) of a route- 
assignment is the sum of the delays that it would cause to all of the trains. It is dependent on the 
chosen train-movement rule. The problem is to find the global minimum of the cost. 

The fitness of the chromosome corresponding to a route-assignment is defined to be 



f{A) = 



1 



Optimum search may require that the measure of fitness be varied as a function of generation, since 
at the beginning of a search the population may be contaminated by a few highly unfit individuals, 
while towards the end, the population may become so homogeneous that the optimum fitness changes 
little between generations. Moves over the cost surface are generated via a standard set of operators 
used in genetic algorithms, the reproduction operator, the simple crossover operator and the random 
mutation operator. The size N of the population of chromosomes is held constant. 

During reproduction, chromosomes are chosen from the population A{t) to form the population 
A(t -h 1). A stochastic choice is made, with each chromosome of A{t + 1) being chosen from A(^) 
according to the probability distribution 

^ EA'eMt)\(A'Y 

Simple crossover has been implemented: pairs of chromosomes chosen at random and split at a 
randomly-chosen site. Two new chromosomes are then formed by swapping the pair of leading 
sections. A mutation replaces the value of a randomly-chosen gene of a randomly-chosen chromosome 
by a randomly-chosen value, i.e. a random change is made to a randomly-chosen route- assignment. 
Each chromosome maps to a point on the cost hyper-surface. The reproduction operator selects a 
sub-set of the points represented by the population A(t) to be written to the population A{t -h 1). 
The effect of reproduction is that, on average, points having lower cost values will be favoured. The 
crossover and mutation operators then scatter the algorithm’s attention from this subset of points. 
The effects of the reproduction operator may be described quantitatively, and the scattering strategy 
may be described qualitatively, using the concept of schemata [3]. A schema is a partial-specification 
of a chromosome where the values of a sub-set of the genes are defined, whilst the remainder are 
undefined. If the asterisk symbol is used to represent an undefined value, then an example schema is 



H = {* * 3 4 ♦ ♦ 1 ♦ ♦ *). 



The significant properties of a schema H are its order o{H)j the number of defined values, three 
for the above example, and its length, S(H), the number of genes between its first and last defined 
values, four for the above example. As shown by Goldberg [2], the behaviour of a genetic algorithm 
may be analysed using the Schema Theorem. Let m(i/, i -f 1) be the expected number of a schema 
H in generation t -f 1 given that there were t) instances in generation t and suppose that the 
fitness of H is /(If), then 
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where 






HeA{t) 



is the average fitness of schema H within population A(t). It follows that reproduction causes an 
exponential increase in the number of fit schemata, at the expense of unfit schemata. 

The effect of mutation is to prevent a population from becoming too homogeneous, so that a global 
optimum may be missed or that changes in the search environment may not be accommodated. 
The rates of crossover and mutation are both problem-dependent and environment dependent. An 
analysis of crossover [1] shows that its operation favours short schemata which contribute an above 
average fitness value. This implies that a route-assignment genetic algorithm would perform well if, 
on average, the exploitation of short schemata has a stronger effect than the random introduction 
of new long schemata. This characteristic bias of the crossover operator is well suited to the route 
assignment parameterisation of the terminus shunting problem. Consider a general route assignment 
configuration for a set of incoming trains, with separations of 120 to 150 seconds. The time required 
to complete a route section is 40 — 100 seconds with either an additional 15 seconds to load and 
unload passengers or 150 seconds to reverse the direction of a train. Due to the train-movement rule, 
a train is expected to interact strongly with those trains that occupy the terminus at the same time. 
Trains that arrive at the terminus separated by large time-intervals are unlikely to interact strongly. 
Therefore, on average, adjacent genes on a chromosome are strongly coupled with respect to cost. 
Genes separated by large chromosomal distances are, on average, weakly coupled with respect to 
cost. These assumptions are borne out by the performance of the genetic algorithm, reported in 
the sequel. A discussion of the biases of a number of alternative schemata exchange operators is 
presented in Eshelman [1]. 



3. Results 

3.1 Overview of Test Regimen 

Two experiments were designed to test the genetic algorithm by generating two types of input data. 
A typical timetable for ten trains was chosen as the starting point. The arrival times of some or 
all of the trains were then perturbed to generate the problem sets. The cost function, g{A), was 
the sum of the delays to each train on departure from the terminus. The fitness function used was 
f{A) = g(A)+i • scenario the delays to the arrival times of the trains were all equal, in the 

second the delays were uniformly randomly distributed within a given range. 

3.2 Equal Delays 

The arrival time of each train of the ten was delayed by the same amount, in the range from 10 to 300 
seconds in steps of 10 seconds, yielding 29 sets of train delays. For each data-set the algorithm was 
run ten times using different, randomly generated, initial populations of chromosomes and different 
seeds for the random number generator utilised by the algorithm. The allowed computational time 
was seven seconds on a Silicon Graphics R8000 Indigo 2 workstation. 

The results of this experiment are presented in Figure 2, which shows the total delay experienced by 
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Scenario one: A graph of the total delay summed over ten trains as a function of the 
unifrom delay imposed on each of the ten trains. (Results over ten optimizations) 




Figure 2: Equal Delays 

all ten trains as a function of the the delay imposed uniformly on each of the trains. The upper line 
in Figure 2 is the total delay that the trains have on arrival at the terminus. The next three lines 
show the worst total delay on departure, the average delay on departure for ten separate trials, and 
the least delay on departure. 

The algorithm attempts to eliminate the total delay experienced by all ten trains before they leave 
the terminus. Comparing the worst delay on departure line with the the total delay that the trains 
have on arrival at the terminus shows that the algorithm performed very well in all cases. The 
average delay on departure line shows that the algorithm performed excellently on average. 

3.3 Random Delays 

The arrival time of each the ten trains were delayed by a random amount, chosen within one of a 
number of given ranges. The upper bounds of the ranges spanned from 10 to 300 seconds in steps 
of 10 seconds, yielding 29 sets of train delays. This procedure was repeated four times to generate a 
total of 116 problem sets. For each data-set of each problem-set the algorithm was run ten times using 
different, randomly generated, initial populations of chromosomes and different seeds for the random 
number generator utilised by the algorithm. The allowed computational time was seven seconds on a 
Silicon Graphics R8000 Indigo 2 workstation. Four different randomly generated problem-sets were 
investigated in order to give a representative assessment of the performance of the algorithm. 

A summary of the results is presented in Figure 3, which shows three performance measures averaged 
over each of the four problem sets. The top (diamond) line is the total delay that the trains have on 
arrival at the terminus averaged over the four problem sets. The next line (circles) shows the average 
of worst total delay on departure found for the four problem sets. The third line down (triangles) is 
the average total delay on departure for all the optimisations made on the four problem sets. The 
fourth line down (squares) is the average of the best total delay on departure found for the four 
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Scenario two: Total delay on departure as a function the maximum 
possible value of random delay imposed on arrival. Results averaged over 
four problem sets, each the subject of ten distinct optimisations 




Figure 3: Random Delays 



problem sets. 

The algorithm was always able to reduce the delays on arrival that the trains had incurred. The 
average performance and best results show that the performance of the algorithm was excellent. 

4. Conclusions 

An algorithm based on a train-movement strategy and a genetic algorithm has been developed for the 
terminus-shunting problem. The performance of the algorithm has been assessed in two experiments 
based on the perturbation of a real schedule for a typical terminus, firstly for uniform delays, secondly 
for random delays. The results support the conclusion that the cost hyper-surfaces generated by the 
train-movement rule and the search regimen generated by the genetic algorithm are adequate to solve 
the train-shunting problem for a typical terminus within realistic time-constraints using currently- 
available hardware. The theory and complexity of the genetic algorithm was outlined in light of the 
terminus train-shunting problem. The nature of the algorithm and parameterisation chosen allows 
for near real-time continuous optimisation under a policy of genetic continuity between consecutive 
program executions. 

5. Further Work 

5.1 Continuous Adaptive Optimisation 

Partially or fully optimised route assignments may be retained between program executions, so that 
the population of chromosomes continuously adapts to the changing rail network situation. 
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Consider the case where no fresh delays have been incurred between program executions, and the 
algorithm optimises the route assignments for the next ten trains. In such a case the cost surface 
remains unchanged between program executions. The algorithm may then continue to attempt 
to develop a superior route assignment configuration for the ten trains under the unchanging cost 
surface. 

When it becomes appropriate to fix the route assignment of the nearest train to the terminus, the 
genes corresponding to this train are removed from the population. New genes are then added to the 
rear ends of the chromosomes, which are mapped to the eleventh train approaching the terminus. 
Assuming that no additional delays are incurred, such that the cost surface does not change, the 
algorithm will have a number of program executions to find the optimal route assignment for the 
eleventh train. Under this policy of genetic continuity, the effective computational time available to 
algorithm is greatly increased in situations where no additional delays are incurred between program 
executions. 

In the case where an additional delay is incurred by a train between program executions, it is 
reasonable to expect that the population found during the previously program execution will remain 
of a high quality. Consider a general building block B, i.e. a short schema with a high value of 
fitness. Building blocks adjacent to one another on a chromosome are coupled with respect to cost. 
We therefore observe that the building blocks placed around B constitute a cost environment. The 
schemata difference equation indicates that B will be retained within the population if it couples well 
to different building blocks within a large number of different cost environments. Building blocks 
which are not robust under different cost environments are removed from a population by a genetic 
algorithm. This property of optimising the robustness of building blocks implies that, on average, 
a population retained between program executions will be superior in quality to one generated at 
random. 

The policy of genetic continuity would not be expected to perform well if the cost surface changes 
drastically between program executions. The change in the cost surface between program executions 
should be minimised by choosing a small time gap between program executions. 

5.2 Additional Constraints 

Consideration must be given to the wear of track switches within the terminus in the shunting 
sequence proposed by the algorithm. The use of Route One defined in Figure 1 should be minimised 
in order to avoid wear of a crucial track switch. In addition, the use of this route at peak times may 
cause difficulties with the flow of passengers leaving the terminus. 

Due to its brevity, Route One of Figure 1 may be favoured by the algorithm when dampening 
perturbations. The brevity of Route One is counter-balanced by the fact that it occupies, for an 
extended amount of time, one of the two exit paths from the terminus, so that it is not always 
associated with low costs. In order to reduce its use by the algorithm an additional penalty cost may 
be associated with each occurrence of Route One within a chromosome. 

The results for the algorithm during the first test experiment indicate that an increase in the exploita- 
tive aspect of the algorithm’s investigation during small perturbations would improve performance. 
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The exploitative and exploratory characteristics of the genetic algorithm may be tuned to give better 
performance by changing the population size, the crossover rate of the crossover operator and the 
mutation rate. The fitness surface experienced by the reproduction operator may also be manipulated 
to change the balance of the exploitative and exploratory characteristics of the algorithm. 

An extensive investigation of the effects of such control parameters on the performance of a genetic 
algorithm is given in Schaffer [5]. It is possible to set a genetic algorithm the task of optimising the 
control parameters of another genetic algorithm. Such an optimisation task would, however, require 
a prohibitive amount of computational resources. 
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Pricing and Allocation of Rail Track Capacity 

A. Bassanini, University of San Marino 
A. Nastasi, University of Rome “Tor Vergata” 

Summary: The application of the European Community directives regarding the rail sector in- 

volves the separation between network management and transport service provision and calls for the 
definition of a track access mechanism. 

We propose a game-theoretic model where the transport operators (TOs) request tracks to the 
infrastructure company (IC) and set prices for the transport services they offer with the aim of 
maximizing their profit, considering demand data obtained by aggregating individual customers’ 
preferences. 

The IC allocates the tracks among the TOs and determines tariffs for infrastructure use through an 
allocation and pricing mechanism that keeps into account the competitive nature of the emerging 
organizational structure of the rail industry. 

The mechanism is inherently non-discriminatory, since the IC maximizes the value of the schedule 
of each TO on the basis of the requested tracks and the associated weights. 

Finally, the model enables the IC to choose among different solutions according to overall policy 
criteria. 



1. Introduction 

Rail transport policy has recently been subject to substantial changes, involving a tendency towards 
market liberalization and the transfer of relevant parts of the transport service from the public to 
the private sector (Bowers, 1996). 

In the European rail market this general move has been brought about by EEC directive 91/440, that 
imposes the separation of infrastructure property and management from transport service provision 
and ensures open access to the network to all operators who wish to provide such services. The 
infrastructure should be managed and operated by an infrastructure company. 

From the infrastructure company’s point of view, the new structure of the rail sector calls the 
attention to the problem of defining a track access mechanism. This problem consists of two main 
issues, namely, how to allocate the tracks among the operators who will request them, and, second, 
what pricing scheme should be adopted for the access to the tracks themselves (track usage rights). 
The track access mechanism should also respect the principles of non-discrimination and impartiality 
sanctioned by 91/440 and the complementary 95/18 and 95/19. 

The transport operators, on the other hand, have the freedom to choose their ideal train schedules 
to be requested to the infrastructure company and the prices for the services they provide. These 
decisions are of crucial importance for a rail operator who wishes to maximize its profit. The 
schedule determines a large fraction of an operator’s cost as well as the services it can offer. It also 
determines the quality of such services in terms of departure times and travel duration; quality of 
service and prices, in turn, affect the operator’s ability to attract customers (Dobson and Lederer, 
1993). Schedule decisions should therefore be made keeping into proper account overall demand data. 
The latter are derived by aggregating individual customers’ preferences. Following renowned models 
of consumer choice for transit services by Ben Akiva arid Lerman (1985) and Dobson and Lederer 
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(1993), we assume a random utility model such that each individual customer’s demand is given by 
a logit function. A customer’s demand for a train is a function of the total cost of using that train. 
We assume that a customer’s cost has three components: 1) the cost of departing at a time that 
differs from the customer’s most preferred departure time, or, alternately, the cost of arriving at a 
time that differs from the most preferred arrival time, 2) the cost associated with travel duration 
and 3) the actual fare. The total demand for each train is the sum of all customers’ demand for the 
train. 

The above-mentioned issues underline the complexity of the relations among the various actors 
involved in the new rail scenario. 

In the new context the traditional models for timetable formation are of little practical use, as the 
existing literature in this area actually assumes a single decision-maker, which is a typical character- 
istic of the monolithic and monopolistic structure of the traditional railway organization (e.g. Carey, 
1994; Odijk, 1996). 

On the contrary, the analysis of these novel structural relations requires considering a peculiar form 
of oligopolistic interdependence due to the characteristics of the technical nature of rail transport. 
In particular, the impossibility to assign the same rail track to two or more trains at the same time 
(as opposed, for example, to the analogous route assignment problem for the air traffic case), as well 
as the strong congestion-originated interdependence among different tracks interacting in some way 
with each other (e.g. having a station or line segment in common), make it impossible to determine 
the set of feasible schedules of a rail transport operator independently from the others’ timetable. 
On the other hand, the literature regarding congestion tolls estimation (e.g. Levin, 1981) and peak- 
load pricing (e.g. Crew and Kleindorfer, 1987), either ignores the problems imperfect competitive 
nature or treats spatial aspects separately from the temporal ones. 

This paper makes an attempt at jointly considering spatial and temporal aspects in a context char- 
acterized by oligopolistic interdependence, by adopting a game-theoretic approach. 

In our market-based model, the competing transport operators (TOs) make their track requests 
to the infrastructure company (IC) based on the available demand data and fix the prices for the 
services they provide, with the aim of maximizing their profit, based on: a) their actual schedules, 
and b) the tariffs they have to pay for infrastructure use. Elements a) and b) are computed by the 
IC through a capacity allocation and pricing mechanism, which is designed in order to respect the 
principles of impartiality and non-discrimination sanctioned by the EU directives. 

2. The model 

The competition among the TOs and the interactions between the latter and the IC can be described 
by a three-stage model. 

In the first stage, each TO submits to the IC his track requests in terms of the desired departure and 
arrival times in each station of the trains he wants to run. Each TO behaves in a non-cooperative 
manner and would like to obtain effective schedules as close as possible to the ideal ones. Together 
with the requested timetable, each TO submits to the IC a set of parameters, which represent the 
value (in monetary terms) he attributes to the deviation of one unit of time from each of the requested 
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schedules. 

In the second stage the IC determines the effective timetable and the related tariffs for all the TOs, 
on the basis of the latter’s requests and of values attributed to the tracks. In particular, the global 
timetable is determined by minimizing the deviation from the requested schedules for each transport 
operator, keeping into account the network’s capacity constraints. It follows that congestion plays a 
crucial role in the determination of the infrastructure user fees. 

In the third stage the TOs set the prices of the services for the customers, on the basis of the 
timetable and the tariffs computed by the IC. Each TO maximizes its profit by adjusting prices. 
Therefore, profit is maximized with respect to prices while considering consumer choice; to find 
profit maximizing prices, demand for trains as a function of prices must be determined. Consumer 
choice must therefore be analyzed in order to solve the price optimization problem. 

The three-stage sequential structure of the game imposes that its solution be derived through a 
procedure similar to backward induction, from the third stage up to the first two. In other words, 
the relevant equilibrium notion is the subgame perfect equilibrium (in the sense of Selten, 1975). 
Therefore, the TOs initially have to determine the price equilibrium configurations associated with 
the generic vector of effective timetables and tariffs computed by the IC. Subsequently, the timetables 
to be submitted to the IC and the values attributed to the deviation from the requested timetable 
are obtained on the basis of the equilibrium prices (and profits) related to the combinations of the 
alternatives available to the TOs. The relation between requested timetables and values attributed 
to the deviation on one side, and effective timetable and tariffs on the other is determined by the 
allocation and pricing mechanism implemented by the IC in the second stage. 

3. Notation 

Let us define the following notation: K\ set of all TOs; k\ TO k £ K\ K\k: set of all TOs, excluding 
k\ tk'. train of TO A;; T^: set of trains of TO k\ Tk\tk'. set of trains of TO k, excluding tk\ TK\Tk‘. 
set of trains of all TOs, excluding A;; /: set of principal stations, or yards; Ik', set of all yards in the 
schedule of train tk', i'. a departure yard, i e I; j: an arrival yard, j G /; lAjt^: requested arrival 
time at yard j for train tk of TO k\ IDa^ : requested departure time from yard i for train tk of TO 
A:; Ajti^: effective arrival time at yard j of train tki Dn^\ effective departure time from yard i of train 
tk'. ,....)^; A, =(...., At. ,....)^; AkHK I tk € Tkf'. Dk={Dt, \ tk G Tkf', A={....,Ak 

,....); D=(....,A ,••••); ^k={Ak. A)^; x={.....Xk ,.•••); effective arrival time at yard j of the 

set of trains Tk\tk'. DiT^\tj^: effective departure time from yard i of the set of trains Tk\tk'. 
effective arrival time at yard j of the set of trains TK\Tk] effective departure time from yard 

i of the set of trains TK\Tk] Wt^: monetary value attributed by TO k to the deviation from the 
requested schedule for train tk] DWa^ : minimum dwelling time of train tk at yard i] D[\ consumer’s 
ideal departure time from yard i\ demand (e.g. number of passengers) for train tk] ^k'. demand 
for TO k {^k = EtfcGTjfc tariff that TO k must pay for operating train tk] ™t price for 

the service provided network’s capacity constraints. It follows that congestion plays a crucial role in 
the determination of the infrastructure user fees. 

In the third stage the TOs set the prices of the services for the customers, on the basis of the timetable 
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and the tariffs computed by the IC. Each TO maximizes its profit by adjusting prices. Therefore, 
profit is maximized with respect to prices while considering consumer choice; to find ng due to our 
assumptions), consider transport service between the cities as provided by train tk run by transport 
operator A;; the operator may not be the only TO providing transport between the cities. 

Customers have preferences for trains serving the same OD pair that are a function of the most 
desired departure time, travel duration and service price. 

Note that train tk traveling from o to d may also serve customers demand for any 0-D pair (z, j) 
on the same rail line, provided that i coincides with or is east of o and j coincides or is west of d. 
This issue should be considered when estimating transport demand for train tk] therefore, we will 
first derive the expression for transport demand between the generic 0-D pair (z, j) served by tk and 
then evaluate demand between o and d - always for train tk. 

For any z, j and D[e [0,24) let denote the density of customers whose most preferred 

departure time from z to j is at time D[. It is assumed that this function is piecewise constant 
on [0,24). That is, there are G=G(z, j) constant density intervals over [0, 24); let /3g{i,j) be the 
density of the pth interval, p=0,...,G-l for travel between z and j. Assume for each origin-destination 
pair that customers travel preferences and densities are identical for each day. If this assumption is 
violated, one can redefine the period of analysis so that densities are identical for each period. 
Assume that a customer’s utility function is linear and separable in various attributes of routes. 
Assume money is the numeraire. If a customer whose ideal departure time from z is D[ instead 
leaves at time A, the customer’s utility declines by w \ D{ — D\ \ dollars, for some u; > 0. The 
term | A — DJ | is interpreted to be the difference in hours on a 24-hour clock between Di and D[. 
Customers also suffer a reduction in utility as a function of travel duration, in the amount of v dollars 
per minute for the duration of the trip. Finally, customers paying /z dollars for traveling have their 
utility reduced by fi dollars. 

Assume that, for origin-destination pair (z, j), demand for train tk by customers having ideal de- 
parture time DJ is given by a logit function (Dobson and Lederer, 1993). Then, if D[ is in the ^th 
interval of /?, demand for train tk for customers with ideal departure time D[ is: 

* / wx ^ 0gexp[-a(w I £>; - Du, \ +vAjt, +PtJ . . 

i) exp[-a(w \ D[ - At*, | +vAj,^, + PtJ ^ ’ 

where Tk{i,j) is the set of trains of TO k that serve 0-D pair (z, j). Demand for train tk going from 
0 to d can therefore be written as: 

^hXoj)(D'o)+ 53 ^«t:(o+lj)(-^o+l) + -+ 53 

j=o+l,...,d j=o+2,...,d j=d-l,d 

( 2 ) 

Now, assume each train tk has capacity - e.g. the maximum number of passengers that can be 
carried. This capacity constraint must be satisfied at each yard, where, for example, passengers can 
get on or off the train. Therefore, for each train tk we have d-o constraints of the form: 

E < Mh 

j=o+l,...,d 



(3) 
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E [^t.;W)(i3') + < Me, (4) 

i=r+l,...,d 



E < Me, (5) 



The above set of constraints may be written in a compact form by defining a matrix St^ with stopping 
yards on rows and 0-D pairs on columns. The generic element of can be defined as follows: 






1 if j > / and i = o , / 
0 else 



( 6 ) 



Using relationships (2) and (6) it is possible to maximize firm profit with respect to train ticket 
prices. The profit of TO k can be written as: 



n.(p)= E\pt.^tk-htk]-Ck (7) 

where p is the vector of service prices. 

TO A:’s problem of maximizing profit with respect to prices vector p is: 



maxp Uk(p) 



( 8 ) 



subject to 



St^6 < Mt^e 

where (5 is a vector whose elements are the demand between each 0-D pair and e= (1,...,!)^ is a 
(d - o) X 1 (column) vector. 

The equilibrium prices vector pj is derived by solving the system of the best reply functions of the 
n TOs, associated to the n profit maximization problems. 

4. Track access and capacity pricing 

In the previous section, we illustrated how demand for a specific train is affected by the train’s 
departure time as well as its origin-destination travel time. The latter elements are determined 
by the IC by implementing an allocation and pricing mechanism (Harker and Hong, 1994) that 
computes: i) actual schedules for trains and ii) the corresponding tariffs to be paid for network use. 
The former are obtained by maximizing the value of the effective timetable (i.e. the set of all train 
schedules) for each TO. This means that the weighted deviation from each TOs requested timetable 
is minimized, while enforcing the respect of the capacity constraints of the network and keeping into 
due account the interactions among trains - either belonging to the same TO or to different ones. The 
weights are represented by the values the TOs attribute to the deviation from each of the requested 
tracks; these values are also used to derive the priority level assigned to the considered train in the 
construction of the effective timetable. 
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It should be noted that the mechanism is non-discriminatory, since the IC maximizes the value of 
the schedule of each individual TO according to its requests. 

For this reason, it is possible to analytically represent the allocation and pricing mechanism as a 
game whose set of players is composed of the n competing TOs. In formulating the problem, it is 
important to keep in mind that, as we have already pointed out, it is impossible to determine the set 
of feasible schedules of a transport operator independently from the others’ timetable. This means 
that the players’ strategy sets are nondisjoint. 

For this reason it is not possible to formulate the mechanism as a standard Nash game, as the latter 
does not allow for interactions among strategy sets; the adequate analytical structure, instead, is 
that of a generalized Nash game. It is worth noting that, although the mechanism is implemented 
solely by the IC, since the TO play no active role (they have no strategic variable to determine in 
this stage), its formal structure allows it to be represented as a generalized Nash game where the 
players are the TOs. 

The tariffs in our case are essentially congestion-based ones, in the sense that each of them expresses 
the congestion degree that the train to which it is applied imposes on the system as a whole. This 
pricing scheme is justified by considering that trains’ delay imposes a time cost that builds up to a 
large part of the total overall cost of the system. 

The mechanism also incorporates the elements and the hypotheses of a line model (Harker and 
Hong, 1990) in order to provide a representation of congestion in railway operations - in particular, 
to compute the delay a train undergoes as a result of its interactions with the other trains in the 
system. The expected running time of a train between yards i and j is computed as the sum of the 
trains free running time between the two yards and the expected delays due to casual interferences 
- meets and/or overtakes - with other trains traveling in the system. The priority levels in the delay 
function incorporate the value attributed to the deviation from the requested schedule for the trains 
and have a direct impact on the expected running time of each train; the greater the value, the higher 
the priority level attributed to the train. 

5. Problem formulation 

The model requires lAjt^, as input data, and generates values for Ajt,^, 

^jTk\tk^ ^iTk\tk^ ■^jTK\k’> ATjc\jfc- 

The expected running time E{i,j)t^ of a train tk 6 Tk between yards i and j, calculated by the line 
model, can be written as: 



j)tk ~ j)tk {^jtk 5 Atjfe j AjT^\tk > ^Tk\tk ) ^jtK\k ’ At^\*. ) 

where: {Ajt^, At*): effective arrival and departure times between yards i and j of a train tk] (^jT*\t*, 
DiTk\tk)' effective arrival and departure times between yards i and j of the set of trains Tk\tk] ^ 

effective arrival and departure times between yards i and j of the set of trains TK\Tk. 

Let us now begin to formulate the problem by considering that each train tk must satisfy two operative 
constraints, the first relative to yard dwelling time, the second regarding total travel time between 
two yards. 
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If a train tk arrives at yard j, it must wait for a time at least equal to DWjtk (which is an exogenous 
datum) before leaving at Djt^ . The constraint on the minimum dwelling time is: 

^Jtk ~ ^jtk ^ ^^jtk ^ hitk ^ Tk (9) 

Second, the arrival time of train tk at yard j must be greater than or equal to the departure time at 
yard i plus the expected running time from i to j: 



^jtk j)tki-^jtki ^itkJ -^jTk\tki ^iTk\tkJ -^jT^s^kf 3 ^ ^ Tk 

(10) 

E{i,j)tk is the most probable running time between yards i and j; therefore, the respect of condition 
(10) ensures the feasibility of the schedule of train tk- 

Constraint (10) explicitly represents why the strategy sets of the TOs are non disjoint; TO k should 
know the schedules of the trains of the others, TK\Tk, in order to determine his own set of feasible 
schedules, as the expected running time between yards i and j depends not only on his own timetable, 
but also on those of all the other operators; it is therefore necessary to turn to the generalized Nash 
equilibrium concept. Let us define: XK\k = ....,Xk~i,Xk+i^ The set of feasible schedules 

for TO k e K is then defined by the set: Qk{^K\k) = {^k = i^k^Dk)^ \ constraints (9) and (10) are 
satisfied}. The problem that has to be solved for each TO k can finally be written as: 



Min Uk{xk, XK\k) = X) X ~ + X ~ J^} (11) 

h^Tk jelk *€/fc 



subject to 



^k = {^k,DkY G ^k{^K\k) 



The first term of objective function (11) measures the weighted square of the difference between the 
requested and the effective arrival time, and the second term the weighted square of the difference 
between the requested and the effective departure time at each yard; schedule delay and advance are 
thus both penalized. It should also be noted that the overall timetable is therefore a function of the 
TOs requested schedules and associated values: 



x = x{x,w) (12) 

where x is the vector of all requested schedules and w is the vector of the values associated to the 
requested tracks. 



6. Problem solution and tariff derivation 

For the determination of the effective timetables and tariffs we exploit some results of variational 
and quasivariational inequality theory. 

In fact, in the case presented in this paper we have to make use of the generalized Nash equilibrium 
concept and the search for the optimal solution is based on the theory of quasivariational inequalities 
(QVIs). The quasivariational formulation for a generalized Nash game can be obtained from the 
summation of each individual player’s first order conditions for his optimization problem. In the case 
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we analyze, moreover, the solution set of the QVI equivalent to the generalized Nash game contains 
the solution of a related VI problem, and in a special case - which is also the case in this paper - every 
VI algorithm can be employed in order to calculate a solution to the model; thus, the existence of a 
QVI solution is established.In our case, it is therefore possible to find a VI whose solution gives the 
effective timetable for all the competing TOs. This is particularly expedient since solution algorithms 
for Vis are more numerous and efficient than those for QVIs. 

In any case, we should remark that the obtained equilibrium is only one of a set of equilibrium 
solutions that might exist for the QVI related to the original problem. 

Once the effective timetable has been obtained, the tariffs the IC should impose in order to ensure 
the respect of the capacity constraints can be derived from the dual variables of the constraints in 
the generalized Nash game formulation (Bassanini and Nastasi, 1997a). 

A train with a higher priority level (i.e. to whose deviation from the requested schedule the TO has 
attributed a higher value) would be granted better schedule adherence - and thus experience less 
delay - and will generally pay higher tariffs, unless the track demand on the infrastructure portion 
it exploits is low. 

7. Schedule requests and optimality issues 

In this section we intend to analyze some aspects of the model with respect to its solution procedure 
and final outcome - which should be the subgame perfect equilibrium. 

First of all, consider that in the first stage the TOs have to decide which timetables are to be 
requested to the IC and what values should be associated to each requested track by solving the n 
profit maximization problems: 



max Uk{x{x,w)) VA; = l,...,n 

where the profit of each TO can be written as a function of the requested timetable and the associated 
values by considering the equibrium prices and equation (12). 

Solving these problems would allow us to determine the optimal timetables and values (x*,w*) to 
be requested to the IC. The IC would consequently determine (in the second stage) the effective 
timetables x* = x{x*jW*) and the tariffs and the TOs would choose the equilibrium prices vector 
p* = pk(x*) in the third stage. 

However, it is worth noting that it is possible to determine the subgame perfect equilibrium only 
when in the first stage of the game the number of choice alternatives (in terms of timetable requests 
and associated values) for each TO is restricted. 

As an example, consider an instance in which we limit our attention to a portion of the network 
and each of the n TOs is allowed to submit only a limited number - say, z - of timetable requests 
and related values. In this case, it is possible to enumerate all the combinations of timetables 
and values the TOs can submit to the IC and apply to each the allocation and pricing mechanism 
in the second stage. Subsequently, in the third stage, there are z^ price subgames - one for each 
of the effective timetables and tariffs obtained from the application of the mechanism to the above- 
mentioned combinations. 
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Thus, if a sole subgame perfect equilibrium exists, the outcome of the game is univocally determined. 
In the first stage, the TOs request the Nash optimal timetables and values; in the second stage, the 
IC implements the generalized Nash equilibrium concept and the search for the optimal solution is 
based on the theory of quasivariational inequalities (QVIs). The quasivariational formulation for a 
generalized Nash game can be obtained from the summation of each individual player’s first order 
conditions for his optimization problem. In the case we analyze, moreover, the solution set of the 
QVI equivalent to the generalized Nash game contains the solution of a related VI problem, and in 
a special case - which is also the case in this paper - every VI algorithm can be employed in order 
to calculate a solution to the model; thus, the existence of a QVI solution is established. In our case, 
it is therefore possible to find a VI whose solution gives the effective timetable for all the competing 
TOs. This is particularly expedient since solution algorithms for Vis are more numerous and efficient 
than those for QVIs. 

In any case, we should remark that the obtained equilibrium is only one of a set of equilibrium 
solutions that might exist for the QVI related to the original problem. 

Once the effective timetable has been obtained, the tariffs the IC should impose in order to ensure 
the respect of the capacity constraints can be derived from the dual variables of the constraints in 
the generalized Nash game formulation (Bassanini and Nastasi, 1997a). 

A train with a higher priority level (i.e. to whose deviation from the requested schedule the TO has 
attributed a higher value) would be granted better schedule adherence - and thus experience less 
delay - and will generally pay higher tariffs, unless the track demand on the infrastructure portion 
it exploits is low. 

8. Schedule requests and optimality issues 

In this section we intend to analyze some aspects of the model with respect to its solution procedure 
and final outcome - which should be the subgame perfect equilibrium. 

First of all, consider that in the first stage the TOs have to decide which timetables are to be 
requested to the IC and what values should be associated to each requested track by solving the n 
profit maximization problems: 



max Uk{x{x,w)) VA; = l,...,n 

where the profit of each TO can be written as a function of the requested timetable and the associated 
values by considering the equibrium prices and equation (12). 

Solving these problems would allow us to determined by aggregating individual customers’ prefer- 
ences. 

The IC ensures the respect of the capacity constraints of the network, allocates capacity among 
the TOs and determines tariffs for the use of the infrastructure through an allocation and pricing 
mechanism that keeps into account the competitive nature of the emerging organizational structure 
of the rail industry and ensures the respect of the capacity constraints of the network. 

This mechanism is non-discriminatory, since the IC tries to fulfill the desiderata of each TO compat- 
ibly with the requests of all the other operators. 
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The model is also flexible with respect to the market entry of new operators; if a new operator 
requests a track, the model can easily calculate the effects of this new entry, in terms of additional 
congestion, tariffs for the use of the infrastructure, service prices and profits, on the operators already 
present in the system. 
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Summary: The storage system ROTASTORE, produced by psb GmbH, Pirmasens, Germany, 

allows parallel operation of several layers of carousels. In this paper we will first assess the expected 
performance by means of stochastic computations. In practice such systems are often equipped with 
limited capabilities to reorder incoming goods to achieve more desirable storage allocation. We will 
present a combinatorial sequencing problem, which may be efficiently solved in an online situation. 
Also numerical results based on real-world data and simulation will be discussed. 

1. Introduction 

Modern manufacturing and distribution systems require the use of high performance, short term 
storage. Conventional stacker crane solutions providing the necessary output frequency are often not 
profitable. 

The storage system Rotastore, produced by psb GmbH, Pirmasens, Germany, overcomes these lim- 
itations by implementing a modular multi-carousel principle. This principle allows parallel loading 
and unloading by means of elevators interfacing the vertically stacked carousel layers. 

As Figure 1 shows, a typical configuration consists of two elevators and a stack of carousels with 
identical numbers of slots each. Identically sized trays are transported to and from the lifts by 
conveyors. Each carousel and each elevator may move independently. As indicated by the arrows, 
two combs of punchs simultaneously push trays on all layers in or out of the carousel stack. There 
are configuration variants in which the punchs may be operated individually or more than one set 
of elevators is present or even a single elevator performs both input and output (see [3, 2]). The 
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Figure 1: Typical Rotastore configuration with elevators 



whole assembly is controlled by a sophisticated high-level programmable computer system which 
keeps track of operations and inventory. 



2. Stochastic Measures 

In this section we will try to assess the expected performance of Rotastores with m slots on each of 
n layers. 

2.1 The I/O-cycle 

The system normally operates output-driven, i.e. an I/O-cycle starts with accepting a set of orders 
with one ordered tray on each layer. The carousels then place by rotating the ordered trays in front 
of the output elevator (“seeking”). Since rotation may take place in both directions, never more than 
half a revolution is necessary. Next the punchs push out all ordered trays onto the output elevator 
which immediately starts unloading them onto the leaving conveyer (“stepping”). The carousels 
move the now empty slots in front of the input elevator and the punchs push in the waiting trays. 




Figure 2: Three I/O-cycles of a Rotastore with independent elevators 
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While the elevators load and unload the next I/O-cycle starts with seeking the next set of orders. This 
parallelized action lends the Rotastore a competitive edge compared to conventional stacker crane 
high-bay storage systems. The Gannt diagram of Figure 2 illustrates three consecutive I/O-cycles. 

2.2 Expected Duration of an I/O-cycle 

For the performance of the system the number of I/O-cycles per time unit of course is crucial, there- 
fore we will now try to compute the expected duration of an I/O-cycle under uniformity assumptions. 
A first approach on this problem was made in [5]. A correct result was derived by Klinger in [3]. She 
shows, that the non-uniformity of the carousels movement, which is due to acceleration can safely 
by neglected. Kart nig [2] extends the results to additional configurations which are all different from 
the one we are concerned with here. 

For the following analysis we assume that the outgoing conveyor is fast enough and never blocked. 
Further on there shall be always trays waiting for input. Both assumptions may be met by installing 
enough buffer space before and after the elevators. 

Theorem 1 Let the orders be independently uniformly distributed. Then the expected duration of an 
I/O-cycle of a Rotastore with n layers is 

fi _j_ 5^+1 

Lytjo — ^ 1 ^^^i^maxseeky^elev) ^^pnsh d" t shift (1) 

where tmaxseek the maximal time needed to seek a tray on one layer, tdev is the time the elevators 
need for stepping, = teUv — tpush — tshift is the part of the stepping time which happens in parallel 
to seeking, tp^sh t shift the times for pushing and shifting and finally b := min{j-^^, 1) is the 

ratio between stepping and seeking time. 

Proof: 

Prom Figure 2 it is apparent that the duration of an I/O-cylce consists of a varying part caused by the 
seeking and a constant part caused by pushing and shifting. Further more tmaxseek is the time one carousel 
needs for half a revolution, since all carousels can independently rotate in both directions. It should be 
noted, that 

^elev — ^lO “ 2tpush — tshift ^ niax (tmaxseek? teiev) 

such that tio is constant in the case t^j^^ > tmaxseek and that the theorem is correct for this case. 

Otherwise the seeking times Xi on each layer i are independently uniformly distributed over [0; tmaxseek]? 
since we assume constant rotation speed for the carousels. The expected delay between input and output 
then is 

F(max(A^i , . . . , tgjgy)) = 



r^maxseek 

/ 1 - P(max(Xi, tgiev) < t)dt 

Jo 

f “T- P(A’i < ( A . . . A X„ < < A 4ev < 
Jo 

r^maxseek 

tmaxseek “ / P{X. < t)^dt 

•'^elev 

r^maxseek / f 

tmaxseek ~ j ( T ) dt 

X^maxseek/ 
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n + 6^+^ 
n + 1 



^maxseek 



q.e.d. 

In [3, 2] the same result has been derived for a combined input/output elevator (paternoster). The 
aspect ratio b then simplifies to 6 = since the elevator starts stepping only after the second 

push of the cycle. 



2.3 Expected Number of Trays 

Throughput is trays per time and as long as there is an ordered tray on each layer, the expected 
throughput is as In practice, however, it often happens, that the positions of the ordered trays 
leave one or more layers unoccupied. As a remedy the Rotastore accepts more than n orders which 
are fulfilled n at a time. We generalize the assumption of one independent uniformly distributed order 
on each layer to tit independent uniformly distributed orders anywhere in the Rotastore, where nr 
may be bigger than n. 



Theorem 2 Let there be a set of tit independent uniformly distributed orders. Then the expected 
number of occupied layers is 






( 2 ) 



Proof: 

First, let us assume, all nr orders spread over at most i oi k layers with m slots each. Then there are 
at most ways of choosing the ut slots, multiply counting those where less than i layers 

are occupied. Correcting this by the principle of inclusion and exclusion, the number of ways to choose 
nr positions, at least one on each of k layers computes as 

Since there are ^ ^ possible choices of positions in total and ^ ^ ^ ways to choose k oi n layers to be 

occupied, the probability for k occupied layers is ( ^ )/(n 7 ’,m, fc) ( | , and thus the result follows. 

\kj \nr ) 

q.e.d. 

Direct computation of this number shows, that for practically relevant cases one can expect between 
50% and 80% of the layers to be occupied by orders. If we assume that the Rotastore actually outputs 
one uniformly distributed chosen tray of the ordered trays from each occupied layer and that the 
rotation times of the unoccupied layers can be neglected, the expected throughput can be estimated 

by 

-1 






EnQQQ 



= £’rio, 






EHqqq + 



^ max(tniaxseek5 ^elev) ^tpush ^shift 1 (3) 




3. Sequencing 
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The previous section and practical experience shows, that additional measures have to be taken into 
account to distribute trays evenly with respect to the layers and thus to realize the parallelizing 
speedup of Rotastore. 

We will consider the case of a distribution center of a mayor German department store company. 
In this application two Rotastores are fed with the goods incoming by truckload and automatically 
reassemble them into ten-tray batches destined for the individual departments in the branch stores. 
In this case the trays are already partitioned into sets of orders when input into the Rotastore and 
the Rotastore may output a complete set of orders as soon as all trays of the set of orders are present. 
To improve on the performance, the incoming trays are presorted such that each batch spreads over 
as many layers as possible. 

3.1 The Hardware 




Figure 3: Three stacking frames 



Conveyors transport the trays throughout the system. Reordering is done online by stacking frames, 
small robotic devices which lift one tray high enough from the conveyor to let an arbitrary number of 
trays pass and set the lifted tray down again. As Figure 3 illustrates, usually several stacking frames 
are placed in a row along the conveyer so a tray may overtake several preceding trays easily. 

The assembly is usually located directly in front of the Rotastore in the middle of sufficient length 
of accumulation conveyor (e.g. roller conveyer) to make up for the delay caused. 

3.2 Realizable Permutations 

Obviously only a subset of permutations of trays can be achieved by a stacker system. In what 
follows, if cr is a permutation of the trays, cr(i) denotes the position in the output sequence of tray z, 
i.e. the tray at position i in the input sequence. 

Theorem 3 Let there he o stacking frames. A permutation a of n trays is realizable, iff no tray 
overtakes more than o trays, i.e. a{i) > i — o. If n < o all n\ permutations may be realized, 
o!(o+l)”~° otherwise. 

Proof: 

If tray i overtakes preceding trays, all of them have to be lifted simultaneously to let the overtaking tray 
pass. Thus a{i) >i — o if a is realizable since o frames hold not more than o trays simultaneously. 
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trays: 



layers: 




Figure 4: Bipartite matching for o = 1 



To derive the opposite direction, consider first n < o. Then all trays may be lifted simultaneously 
and set down in an arbitrary sequence. If n > o let a be a realizable permutation of n trays and 
cTj = (n+1 z-fl • • • n) o a be the permutation derived firom a by inserting tray n -|- 1 at position i G 
{n + 1 — o, ...,n + l} into the output sequence. Further more consider the stacking frame assembly after 
i — l trays left. Since no more than o trays are yet to come all of them may be lifted and set down in arbitrary 
sequence, particularly realizing cTj. The claims follow by induction, since obviously all o + 1 permutations 
of type (Ti are unique. Furthermore a realizable permutation stays realizable if one tray is taken out, since 
thereby the number of stacking frames used may only decrease. 

q.e.d. 

From Theorem 3 an elegant representation of all realizable permutations may be obtained. Let 
B = {Vi,V 2 ,E) be a bipartite graph with vertex set Vi being the set of input positions, vertex set Vg 
being the set of output positions and the edge set E connecting each input sequence position i to all 
output sequence positions j with j > i — o. Then the set of realizable permutations is exactly the 
set of perfect matchings in B. Since the trays enter the Rotastore sequentially through the input 
elevator, by permuting the trays are actually assigned to layers of the Rotastore. Figure 4 shows B 
for o = 1 stacking frames. 

3.3 Optimal Sequences 

To determine an optimal permutation we assign costs to placing tray i onto layer j, e.g. Cij = 1, 
if there is already a tray of the same set of orders as tray i on layer j and Cij = 0 otherwise. Then 
the solution of the integer linear program is an optimal realizable permutation: 



min 




(4) 




ij 






= 1 Vj 


(5) 


i 


= 1 Vz 


(6) 


J 

Xij 


G {0, 1} 1 <n s.t. j >i - 0 


(7) 



It is well known that this kind of integer program is totally unimodular and may be solved by some 
version the simplex algorithm. Many efficient matching algorithms solve the problem in polynomial 
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time. The simplicity of the formulation (4)-(7) lends itself to extensions. Objective (4) minimizes 
average cost. In an application where Rotastore needs to output all trays of a set of orders consec- 
utively, performance depends on the number of I/O-cycles needed to completely output a set. For 
a given set of orders the number of I/O-cycles needed for output is the maximum number of trays 
of this set of orders found on a single layer. Let nr input trays be partitioned into p < n subsets of 
orders Oi, . . . , Op and Ckj be the number of trays of set k of orders on layer j. Then the following 
ILP minimizes the average number of I/O-cycles needed to output the orders: 



min 


k 




(8) 


s.t. (5), (6), (7) 








Ckl + ^ij 


VI 


Vl<A;<o, l</<n 


(9) 


ieOk,j=l mod n 









Note that in this formulation tit may be bigger than n and thus facilitates optimization over more 
than one I/O-cycle. Unfortunately this formulation is no longer totally unimodular and its complexity 
status is still under research. 

Effective computation of optimal or good solutions is essential since it will be performed in real-time. 
In an online situation the nth tray may be yet unknown when tray 1 reaches the stacking frame 
assembly. In general, data of a varying number of trays may be available. In particular part of the 
relevant information will only become known in the future. As pointed out in [1] a-posteriori analysis 
of the performance of the algorithms is complicated by the fact that the output decisions are based 
on the distribution of orders in the store and in turn influence the cost coefficients for the input 
sequence optimization. 

4. Numerical Results 

For the case of the distribution center mentioned at the beginning of Section 3. we implemented a 
simulation model to test the effectiveness of (4)-(7). The application features two Rotastores with 
eight layers of 113 slots each. The input of each of the Rotastores could be rearranged by two stacking 
frames. In the simulation we compared the greedy- type heuristic implemented by the vendor with 
IP (4)- (7) for a varying number of stacking frames on one workday worth of data. The performance 
measure is the average number of I/O-cycles needed to completely output a set of orders. 

Figure 5 shows this number as it varies over time. The bold bottommost line represents a lower 
bound, which is computed by spreading each set of orders over all layers. Since the average size of 
the sets of orders was ten and there were eight layers, the lower bound approaches 1.2. The diagram 
shows, that already with two stacking frames an improvement of 18% was achieved and a gap of only 
8.8%, which may be reduced to 2.5% with seven frames. Since in this application single I/O-cycles 
of eight trays each were optimized, Theorem 3 shows that additional frames will not improve the 
results any more. 
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Figure 5: Numerical results from simulation 

5. Conclusions and Future Work 

In this paper we demonstrated how the performance of a complex storage system may be significantly 
enhanced. This was achieved by employing an eflftcient algorithm of combinatorial optimization in 
an online fashion without further investments into hardware. 

In many practical applications, e.g. commissioning, the assignment of trays to sets of orders is 
unknown at the time of input and there are strict restrictions on the output sequence, e.g. overlapping 
two sets of orders may be forbidden. Since the sequence of the trays output on a single I/O-cycle 
depends on the layers which they are located on, the unaided system cannot guarantee the sequence 
restrictions without severe performance penalties. A stacking frame assembly on the leaving elevator 
alleviates this problem partially. This topic is currently under research. 

Further on, the operation of the stacking frames themselves poses an interesting challenge for com- 
binatorial optimization, since there is considerable freedom in choosing how to realize a given per- 
mutation. In simulation experiments Muller [4], showed that different allocation strategies result in 
throughput variations of the stacking frame system of up to 90%. The optimal operation strategy is 
still an open question. 
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Zusammenfassung: Die 291 Produktgruppen eines Schweizer Nahrungsmittelherstellers werden 

entweder in seinen Produktionsstatten hergestellt oder von unabhangigen Lieferanten zugekauft und 
an iiber 3 ’400 Kunden in der gesamten Schweiz von verschiedenen Depots mit dem firmeneigenen 
Fuhrpark verteilt. 

Ziele der vorgenommenen Studie waren die Bestimmung der Einsparungsmoglichkeiten durch die 
SchlieBung einzelner Depots sowie die Betrachtung eines “Outsourcing” fiir einen Teil der Fein- 
verteilung an potentielle “Depositare” . Letztere sind 82 Grofikunden, die gegebenenfalls die Fein- 
verteilung an einzelne Kundenregionen iibernehmen konnen und fiir diese Dienstleistung einen Ab- 
satzpreisabschlag erhalten. 

Mittels Vereinfachungs- und Schatzmethoden konnte obige Problemstellung auf ein einstufiges unka- 
pazitiertes Standortproblem reduziert werden. Von wesentlicher Bedeutung waren in diesem Zusam- 
menhang die Analyse und korrekte Aufbereitung verschiedener Einzelkomponenten der hoch ag- 
gregierten Kostenkoeffizienten sowie die Durchfiihrung einer Reihe von Sensitivitatsbetrachtungen. 



1 . Problemstellung 

Die Distributionsstruktur eines Unternehmens umfafit iiblicherweise die Ebenen der Produktions- 
und Verteilzentren sowie der Absatzregionen. Wahrend letztere mit einer konstanten oder variablen 
Nachfrage ihre Standorte im wesentlichen nicht verandern, handelt es sich bei den Standorten der 
Produktion und der Verteilung um firmenspezifische strategische Entscheide, die die operativen Be- 
dienungskosten der Kunden stark beeinflussen. In diesem Zusammenhang wird im folgenden eine 
Standortstudie vorgestellt, die fiir einen Schweizer Nahrungsmittelhersteller durchgefiihrt wurde. 
Die Ausgangssit nation bildeten die vier in der Schweiz verteilten Produktionszentren mit ihren jew- 
eiligen Depots. Die Produkte der 291 Produktgruppen werden entweder in den eigenen Produktion- 
sstatten hergestellt und in den anliegenden Depots nach dem Vortransport (Transport auf der ersten 
Stufe zwischen Produktionsstatten und Depots) zwischengelagert oder direkt durch Drittlieferanten 
den Depots zur Verfiigung gestellt. Innerhalb der Feinverteilung (Transport auf der zweiten Stufe 
zwischen Depots und Kunden) erfolgt schliefilich die Belieferung der Kunden mit den nachgefragten 
Produkten durch die Depots. 

Da weder die Option einer Schliefiung einzelner Produktionszentren noch eine Umverteilung der 
Produktion einzelner Produkte zwischen den Produktionsorten zur Diskussion stand, beschrankte 
sich die Aufgabe auf die Bestimmung der Standorte der Verteilzentren sowie der Kundenzuordnung. 
Als potentielle Verteilzentren galten in diesem Fall die vier bestehenden Depots in unmittelbarer 
Nahe der Produktionsstatten sowie 82 “Depositare”. Bei letzteren handelt es sich um Grofikunden, 
die die Feinverteilung an kleinere Kunden gegen einen Absatzpreisabschlag iibernehmen wiirden. 
Geringere Margen wiirden in diesem Fall durch Einsparungen in den Distributionskosten seitens des 
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N ahrungsmit telherstellers kompensiert . 

Das Ziel der Kostenminimierung konnte somit unabhangig von den Produktionskosten verfolgt war- 
den und bezog sich, unter Beriicksichtigung der Fahrtzeitrestriktionen und Fahrzeugkapazitaten bei- 
der Transportstufen, auf die Vortransportkosten von den Produktionsstatten zu den Depots oder 
Depositaren, die variablen Kosten des Warenumschlags in den Depots, den infolge der Abgabe von 
Kunden an Deposit are entstehenden Umsatzverlust, die fixen Betreibungskosten der Depots und die 
Feinverteilungskosten. 

Nachfolgend wird naher auf die Datenaufbereitungs-, Modellierungs- und Parametrisierungsaspekte 
sowie auf die durchgefiihrten Analysen eingegangen. 

2 . Datenaufbereit ung 

Bei der Datenaufbereitung wurden die einzelnen Stationen des Warenflusses (Produktion - Depot - 
Kunde) und deren Verbindungen untersucht. 

2.1 Produkte 

Insgesamt warden 291 Produktgruppen entweder in einem der Produktionszentren hergestellt (und 
durchlaufen von dort aus den Vortransport) oder von externen Lieferanten direkt an die Depots 
Oder Depositare geliefert. Die Produktgruppen charakterisieren sich durch einen durchschnittlichen 
Absatzpreis und einen den allfalligen Depositaren zu gewahrenden Preisabschlag. 

2.2 Produktionszentren 

Die an einem Standort hergestellten Produkte warden im Vortransport mit grofien Fahrzeugen 
entsprechend der Nachfrage und der Kundenzuordnung zu den Depots oder Depositaren gebracht. 
Fiir die nicht durch den Nahrungsmittelhersteller selbst erzeugten Produkte wird dieser Vortransport 
durch Drittlieferanten iibernommen. 

Die zur Verfiigung stehenden Daten der Produktionszentren umfassen die Koordinaten des jeweiligen 
Ortes sowie einen “Produktverfiigbarkeitsindex” , der anzeigt, ob ein bestimmtes Produkt an einem 
Produktionsort verfiigbar ist oder nicht. 

2.3 Verteilzentren 

Als Verteilzentren wurden die vier bestehenden Depots und die 82 potentiellen Depositare beriick- 
sichtigt. Fiir die Depositare ist ein maximaler Belieferungsradius festgelegt, innerhalb dessen sie 
Kunden versorgen konnen. 

Die Depots beliefern die Kunden mit kleineren Fahrzeugen auf Touren, die sowohl durch die Fahrzeugka- 
pazitat als auch durch die Fahrtzeitrestriktionen bestimmt warden. 

Insgesamt setzen sich die Depot- und Deposit ardaten aus den Koordinaten der zugehorigen Orte, 
den Fixkosten eines Depots, den variablen Kosten des Umschlags pro kg im Depot sowie den beim 
Vortransport zu den Depots und Depositaren anfallenden Abladezeiten zusammen. 
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2.4 Kunden 

Die 3 ’400 Einzelkunden wurden dem euklidisch nachstgelegenen Strafiennetzwerkknoten zugeordnet 
und ihre Nachfrage in Kilogramm pro Produktgruppe entsprechend aggregiert. 

2.5 Fahrzeugdaten und Strafiennetzwerk 

Pro Transportstufe wurde ein bestimmtes vorgegebenes Standardfahrzeug und dessen Eigenschaften 
(variable Distanz- und Zeitkostensatze, Hochstgeschwindigkeit, Leistung, Masse und Kapazitat) be- 
riicksichtigt. Fiir diese beiden Fahrzeugtypen wurden die kostenoptimalen Wege sowie die daraus 
resultierenden Fahrtzeiten und -distanzen auf der Basis eines am Institut fiir Unternehmensforschung 
vorhandenen Schweizer Strafiennetzwerks mit rund 9’000 Kanten und 7’000 Knoten unter Beachtung 
unterschiedlicher Strafienklassen und deren Grundgeschwindigkeiten berechnet. 

3. Modell 

Obige Problemstellung entspricht einem zweistufigen kapazitierten Mehrproduktmodell, das mittels 
zusatzlicher Annahmen auf ein einstufiges unkapazitiertes Standort modell reduziert werden kann. 
Zur Beschreibung des Modells wird die folgende Notation verwendet: 

Indizes 

2 = 1,...,^ indiziert die Produktionsstatten, 
j = 1, n indiziert die Depotstandorte bzw. Depositare, 
k = , m indiziert die Kundenorte, 

/i = 1, . . . , p indiziert die Produkte bzw. Produktgruppen. 

Parameter 

aih bezeichnet das Angebot an Produkt h im Produktionsort 2 , 

Sj bezeichnet die Umschlagskapazitat des Depots j bzw. die maximale Abnahmemenge des De- 
positors jy 

bkh 1st die Nachfragemenge pro Woche des Kundenorts k an Produkt h, 

6ijh umfafit die Kosten des Transports einer Mengeneinheit des Produkts h von der Produktion- 
sstatte 2 zum Depot bzw. Depositor j, 

7 jtj umfafit die Kosten der Feinverteilung des Bedarfs des Kundenorts k ausgehend vom Depot j, 
pjk bezeichnet die variablen Kosten des Umschlags einer Mengeneinheit des Produkts h im Depot j, 
Vh bezeichnet den Verkaufspreis des Produkts h, 
ajh bezeichnet den Preisabschlag beim Depositor j fiir das Produkt /i, 
fj bezeichnet die fixen Betreibungskosten des Depots j pro Woche. 
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Variablen 



yj ist eine binare Variable, welche den Wert 1 annimmt, wenn das Depot bzw. der Depositar j 
gewahlt wild, 

Zkj ist eine binare Variable, die den Wert 1 annimmt, wenn der Kundenort k durch das Depot bzw. 
den Depositar j versorgt wird, 

Xijh bezeichnet die vom Produktionsort i zum Depot bzw. Depositar j zu transportierende Menge 
des Produkts h. 



Fafit man die variablen Depotkosten pro Mengeneinheit und den durch Abgabe von Kunden an einen 
Depositar entstehenden Umsatzverlust in der Grofie 



Vjh — 



{ 



Pjh 

Oijh • '^h 



, falls j ein Depot indiziert und 
, falls j einen Depositar indiziert. 



zusammen, so ergibt sich die folgende mathematische Darstellung des zweistufigen, kapazitierten 
St andort problems : 



q n p n m y p n 

min ^ ^ ^ OijhXijh + E Z) ( Jkj d- 'Hjh^kh j ^kj + X!/ 

i=lj=lh=l h=l ^ j=l 

unter den Nebenbedingungen 



n 





= 


1 VA:, 


j=i 

n 

^ijh 


< 


^ih ^ 


k=l \=1 ^ 

q m 

^ijh ~ y~^ ^kh^kj 


< 


SiVj Vj, 


_ 


0 Vj, /i, 


t=l k=l 

^ijh 


< 


^ihVj V i, j, h , 


^kj 


< 


Vj 


Xijh 


> 


0 


Zkj 


G 


o 

< 


Vj 


€ 


{0,1} Vj. 



( 1 ) 



( 2 ) 

(3) 

(4) 

(5) 

( 6 ) 

(7) 

( 8 ) 
(9) 

(10) 



Der erste Summand in der zu minimierenden Zielfunktion (1) gibt die Kosten des Vortransports 
aller Produkte wieder. Der zweite Summand umfafit die Feinverteilungskosten, die variablen De- 
potkosten bzw. die infolge der Abgabe von Kunden an Depositare entstehenden Umsatzverluste. 
Der dritte Summand liefert schliefilich die fixen Depotkosten einer Losung. Die Nebenbedingungen 
(2) bedeuten, dafi jeder Kundenort zu bedienen ist. Die Nebenbedingungen (3) bringen die Produk- 
tionskapazitatsbeschrankungen zum Ausdruck. Die Nebenbedingungen (4) beschreiben einerseits die 
beschrankte Kapazitat der Depots bzw. Depositare und bewirken andererseits, dafi kein Kundenort 
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von einem nicht gewahlten Depot bzw. Depositar bedient werden kann. Die Nebenbedingungen 
(5) bringen schliefilich zum Ausdruck, dafi die zu einem Depot (Depositar) gebrachten Mengen von 
diesem wieder an Kundenorte abzugeben sind. 

Unter der Annahme unbeschrankter Produktionskapazitaten erhalt jedes Depot j bzw. jeder De- 
positar j seinen kompletten Bedarf an Produkt h durch den dafiir giinstigsten Produktionsort ij, 
d.h. 



^ijjh — ^ 

Liegen ferner keine Kapazitatsbeschrankungen bei den Depots bzw. Depositaren vor, so reduziert 
sich obiges Grundmodell auf das nachfolgende unkapazitierte einstufige Standortproblem: 



min 



fjVj 



k=l j=l j=l 



( 12 ) 



unter den Nebenbedingungen 



= 1 
j=i 

^kj ~ Uj ^ 0 V fc, J , 

Zkj, Vj € {0,1} VA;, j. 



(13) 

(14) 

(15) 



Die Kostengrofie Ckj umfafit dabei samtliche auf die Bedienung des Kundenortes k durch das Depot j 
bzw. den Depositar j entfallenden Kosten (Vortransport, Depotumschlag bzw. entgangener Umsatz, 
Feinverteilung). Auf die genaue Zusammensetzung dieser Kostengrofie und die Ermittlung ihrer 
Komponenten wird im folgenden Abschnitt eingegangen. 



4. Parameter 

Die Kostengrofien Ckj im vereinfachten Modell umfassen die gesamten bei der Bedienung des Kunde- 
nortes k durch das Depot (den Depositar) j auftretenden Kosten und setzen sich zusammen aus 

1. den Kosten des Vortransports des Bedarfs Bk des Kunden k zum Depot bzw. Depositar j, 

2. den Kosten Ckj des Umschlags des wochentlichen Gesamt bedarfs Bk des Kunden k im Depot j 
bzw. den infolge der Abgabe dieses Kunden an den Depositar j entstehenden Umsatzverlust 
und 

3. den Kosten der Feinverteilung des Bedarfs Bk des Kunden k ausgehend vom jeweiligen 
Depot j. 

4.1 Vortransportkosten 

In die Ermittlung der Kosten die der Vortransport des Bedarfs Bk des Kundenortes k zum 
jeweiligen Depot bzw. Depositar j verursacht, fliefien eine Reihe unterschiedlicher Komponenten ein: 



die Distanz sowie Fahrtzeit Uj zwischen den Produktionsstatten i und den Depot- bzw. 
Depositarstandorten j, 
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• die Fahrtkosten des Vortransports pro km, 

• die Fahrt- und Standzeitkosten des Vortransports pro Minute, 

• eine fixe An- und Abfahrtszeit beim Vortransport, 

• die durchschnittliche Kapazitat der fiir den Vortransport eingesetzten Fahrzeuge und 

• der Produktverfiigbarkeitsindex Oj/i der verschiedenen Produkte h an den einzelnen Produk- 
tionsorten i. 



Bei einer Fahrzeugkapazitat in Hohe von konnen die Kosten 9ijh des Vortransports einer Men- 
geneinheit des Produkts h vom Produktionsort i zum Depot bzw. Depositor j abgeschatzt werden 
durch 



^ijh = 



{ 



(2 • dij -w^a + ^-Ur < + < • I or 

00 



, falls Uj/i > 0 , 
, andernfalls. 



Die an einem Produkt h benotigten Mengen bezieht das Depot j bzw. der Depositor j vom dafiir 
giinstigsten Produktionsort ij, welcher sich gemafi der Bestimmungsformel (11) ergibt. Fiir die 
Vortransportkosten ergibt sich somit 



^kj ~ ^kh * ^ijjh 
h=l 



(16) 



4.2 Umschlagskosten bzw. Kosten aus Umsatzverlust 



Die Umschlagskosten ergeben sich aus der wochentlichen Produktnachfrage bkh des Kundenortes 
k pro Woche sowie den Umschlagskosten pjh pro Mengeneinheit des Produkts h im Depot j gemafi 

= E P}h ■ hh • (17) 

h=l 

Die analogen Kosten in Form von Umsatzverlusten bei Abgabe des Kundenortes k an einen De- 
positor j ergeben sich aus der Produktnachfrage hkh^ den Verkaufspreisen Vh^ den zu gewahrenden 
Preisabschlagen der variablen Abladezeit und den Standzeitkosten w'l gemafi 

^Ij = E “jh • ■ bkh + (i""" • o ■ E . (18) 

h=l h=l 

4.3 Feinverteilungskosten 

Die Feinverteilungskosten c{j fallen nur an bei der Bedienung eines Kunden durch ein Depot. Zur 
Ermittlung dieser Kosten ist der auf einen Einzelkunden k' entfallende Anteil bzw. an der 
Fahrtzeit und -distanz einer Fahrzeugtour, die im Depot j startet und den Kunden k' umfafit, 
abzuschatzen. 

Die Gesamtdistanz Dk>j einer Tour, welche den Kunden k’ vom Depot j aus bedient und insgesamt 
rik'j Stops umfafit, kann dabei durch 



Dk'j — 2d/5j + 1) ' bk>j 
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abgeschatzt werden (siehe Fleischmann [1] sowie Klose k Stahly [3]). Dabei bezeichnet dkj die 
Distanz zwischen dem Depot j und dem Knoten k des Strafiennetzwerks, dem der Einzelkunde A;' 
zugeordnet ist. Diese Distanz dient zur Schatzung der An- bzw. Abfahrt distanz. Die Grofie 5k>j 
bezeichnet eine Abschatzung der ”Kunde-zu-Kunde-Distanz” auf einer Tour, die ermittelt wird als 
die mittlere Distanz zwischen den zum Einzelkunden k' gehorigen Netzwerkknoten k und den beiden 
nachstgelegenen Netzwerkknoten. 

Analog ergibt sich als Schatzung der gesamten Fahrtzeit Tk>j einer solchen Tour 

Tk>j = 2tkj + (rifc/j - 1) -Tk>j . 

Dabei ist tkj die Fahrtzeit zwischen dem Depot j und dem Knoten k des Strafiennetzwerks, dem 
der Einzelkunde k' zugeordnet ist. Diese Grofie dient zur Schatzung der An- bzw. Abfahrtzeit. Die 
Grofie Tk>j bezeichnet eine Abschatzung der “Kunde-zu-Kunde- Fahrtzeit” auf einer Tour. Sie wird 
ermittelt als die mittlere Fahrtzeit zwischen den zum Einzelkunden k' gehorigen Netzwerkknoten k 
und den beiden nachstgelegenen Netzwerkknoten. 

Die oben angesprochenen Distanz- und Fahrtzeitanteile d^ij und ergeben sich dann einfach durch 
Division mit der moglichen Anzahl Stops Uk>j auf einer solchen Tour, d. h. 

tf^tj = Tk'j/fik'j und dktj = I^k' j / j • 

Betragt die Abladezeit beim Einzelkunden k' somit Lk> Minuten, die Kosten pro Minute Fahrt- bzw. 
Standzeit in der Feinverteilung w{ und die Kilometerkosten in der Feinverteilung so erhalt man 
damit fiir die Feinverteilungskosten, die die Bedienung des Einzelkunden k' durch das Depot j pro 
Auslieferung verursacht, den Betrag 

Ck’j = • dl'j + w( w{ ■ Lk> . 

Sofern der Einzelkunde k' allerdings vom Depot j aus nicht in der maximal verfiigbaren Zeit 
erreichbar ist (d. h. 2 • tk>j-\-Lk> > T^^), so ist der Kostenwert Ck>j auf einen beliebig grofien Wert zu 
setzen. Die gesamten auf die Bedienung des Kundenortes (Netzwerkknotens) k durch das Depot j 
pro Woche entfallenden Feinverteilungskosten betragen somit 

~ XI ^k'j • 9k' J (19) 

k'£k 

wobei Qk' die Belieferungshaufigkeit des Einzelkunden k' pro Woche darstellt. 

Die einzige bisher nicht naher erklarte Grofie, die zur Bestimmung der Feinverteilungskosten benbtigt 
wird, ist die anzusetzende Anzahl Uk>j Stops der jeweiligen Tour. Sie lafit sich ermitteln als die 
maximale Anzahl Stops, die bei einer Fahrzeugkapazitat in der Feinverteilung, einer maximalen 
fiir eine Tour verfiigbaren Zeit in Hohe von sowie einer durchschnittlichen Ablademenge von hk> 
und Abladezeit von Lk> noch moglich ist, ohne dafi die Fahrzeugkapazitat bzw. die erlaubte Tourdauer 
iiberschritten wird. Die Anzahl an Stops kann somit als die grofite ganze Zahl bestimmt werden, 
welche den Bedingungen 
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und 

2tkj + {rik>j — 1) • Tjfe/j 4- rik^j • Lk> < 

noch geniigt. Die mittlere Ablademenge bk> bzw. Abladezeit Lk> wird dabei abgeschatzt mittels 
des Durchschnitts der Bedarfe pro Auslieferung bzw. der Abladezeiten aller dem Netzwerkknoten k 
zugeordneten Einzelkunden. 

5. Durchgefuhrte Analysen 

Die Losung des beschriebenen Modells erfolgte mit Hilfe eines auf Dual- Ascent- und Subgradien- 
tenverfahren basierenden Branch-and-Bound-Algorithmus (siehe Klose [2]), wobei verschiedene, fiir 
den Nahrungsmittelhersteller wichtige Parameterkonstellationen und Szenarien durchgerechnet sowie 
eine Reihe von Sensitivitatsanalysen durchgefiihrt wurden. 

5.1 Depots 

Eine erste Untersuchung wurde unter der Annahme durchgefiihrt, dafi nur die aktuellen vier Depots 
genutzt werden konnen. In diesem Fall ergab sich als kostenoptimale Variant e jene, in der auch alle 
vier Depots zum Einsatz kommen. Aufgrund der Losung wurde erkannt, dafi die auf die Strafien- 
netzwerkdistanzen bezogenen Kundenzuordnungen nicht immer den kiirzesten Luftliniendistanzen 
entsprechen. Ein weiterer gewichtiger Entscheidungsfaktor fiir die Zuordnungen lag im Vortrans- 
port. Einige Kunden wurden einem weiter entfernten Depot zugeordnet, falls dieses iiber die meisten 
vom Kunden nachgefragten Produkte (im Gegensatz zu seinem nachsten Depot) verfiigte. 

5.2 Depots und Depositare 

Unter zusatzlichem Einbezug der 82 potentiellen Depositare mit einem maximalen Belieferungsradius 
von 15 km wurde eine Kostensenkung von knapp 22% gegeniiber der vorherigen Losung erzielt. In 
diesem Fall wurden nur noch zwei Depots, dafiir aber 65 Depositare beriicksichtigt. 

5.3 Kostenkurve 

Die Kostenkurve zeigt die Gesamtkosten einer optimalen Losung in Abhangigkeit von der An- 
zahl auszuwahlender Depots und Depositare. Ihre Ermittlung erfolgte indirekt durch Einsatz der 
Technik der Lagrange- Relaxation und des Verfahrens der Tangentialapproximation, welches die 
Fixkostensatze systematisch variiert (siehe Klose und Stahly [3]). Bei einem maximalen Beliefer- 
ungsradius von 15 km fiir die Depositare ergab sich eine Mindestanzahl von zwei geoffneten Depots, 
um eine zulassige Losung realisieren zu konnen. Schon bei drei geforderten Verteilzentren wurde ein 
Depositor neben den zwei Depots beriicksichtigt, und ab 67 Depots/Depositare war keine Kosten- 
verbesserung mehr moglich. 

5.4 Lieferbereich 

Erhohungen des Belieferungsradius der Depositare auf bis zu 50 km hatten zur Folge, dafi immer 
weniger Depositare benotigt wurden, die Gesamtkosten weiterhin sanken und andere als die ur- 
spriinglichen Depots in die Losung einbezogen wurden. Der entsprechende Umsatzverlust infolge der 




530 



zu gewahrenden Preisabschlage wurde durch die Einsparungen von variablen Depotumschlags- und 
Fein vert eilungskosten iiberkompensiert . 

5.5 Kostensatze 

Sensitivitatsanalysen von Losungen beziiglich einer Veranderung der fixen oder der variablen Kosten 
eines einzelnen Depots bei Konstanz aller anderen Parameter beantworteten die Fragen 

• um welchen Betrag die fixen oder variablen Kosten eines in der optimalen Losung gewahlten 
Depots maximal steigen diirfen, ohne dafi die Wahl dieses Depots nicht mehr optimal ist, bzw. 

• um welchen Betrag die fixen oder variablen Kosten eines in der optimalen Losung geschlossenen 
Depots mindestens sinken miissen, damit die Wahl dieses Depots optimal wird. 

Zusatzlich wurde noch untersucht, um welchen Betrag jeder Fixkostensatz der vier Depots sich 
mindestens und maximal verandern mufi, damit eine optimale Losung entsteht, in der, unter Be- 
riicksichtigung der Depositare, genau 1, 2, 3 oder 4 Depots gewahlt sind. In diesen Fallen wurde 
wiederum von der Konstanz aller iibrigen Einflufifaktoren ausgegangen. 

6 . Schlufifolger ungen 

Eine aufwendige Datenaufbereitung bzgl. Produktions- und Verteilzentren, Kunden sowie Fahrzeu- 
gen fiihrte wahrend des durchgefiihrten Projektes zu einer guten Annaherung der Ausgangssituation 
eines Schweizer Nahrungsmittelherstellers. Zusammen mit einer vereinfachten Modellierung trug 
dieser erste Schritt entscheidend dazu bei, dafi die anschliefienden Losungen des einstufigen unka- 
pazitierten Standortmodells dem Auftraggeber plausibel erschienen. Dementsprechend wurden neue 
Kundenzuordnungen iibernommen und Verhandlungen mit einzelnen Depositaren in Gang gesetzt. 

Anmerkung: Dieses Projekt wurde von der Schweizerischen Kommission fiir Technologie und In- 

novation (KTI) unterstiitzt. 
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Summary: Let A be a nonempty finite subset of the plane representing the geographical coordinates 
of a set of demand points (towns, . . . ), to be served by a facility, whose location within a given region 
S is sought. 

Assuming that the unit cost for a € A if the facility is located at a; G S' is proportional to dist{x, a) 
— the distance from x to a — and that demand of point a is given by Ua, minimizing the total 
transportation cost TC{u^ x) amounts to solving the Weber problem. 

In practice, it may be the case, however, that the demand vector u is not known, and only an 
estimator u can be provided. Moreover the errors in such estimation process may be non- negligible. 
We propose a new model for this situation: select a threshold value jB > 0 representing the highest 
admissible transportation cost. Define the robustness p of a location x as the minimum increase in 
demand needed to become inadmissible, i.e. p(x) = min {Ho;* — c2;|| : TC{u*,x) > B, u* > 0} and 
find the x maximizing p to get the most robust location. 

1. Introduction 

In location planning one is typically concerned with finding a good location for one or several new 
facilities with respect to a given set of existing facilities (clients). The most common model in planar 
location theory for increasing the quality of the location of one new facility is the so-called Weber 
problem, where the average (weighted) distance of the new to the existing facilities is taken into 
account (see [6] [14] [11]). 

More precisely we are given a finite set A of existing facilities (represented by their geographical 
coordinates) and distances da assigned to each existing facility a e A. Additionally, weights Ua 
reflecting the relative importance of existing facility a 6 A are provided. 

With these definitions the objective function for the Weber problem can be written as 

TC(uj,x) :='£ujada{x) 

oeA 

which should be minimized over all x in the plane or over a nonempty closed subset 5 C IR^ for 
given weight set u = {uJa)aeA- 

When applying this model to real world problems, mainly two sets of parameters have to be deter- 
mined: 



1. What kind of distances da should be used in the model. 
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2. How can we determine the weights Ua. 

A lot of research for finding appropriate distance functions for applying the Weber problem to different 
geographical settings has been done in the last decades, starting with ([15]). Other contributions to 
this topic can be found in [9], [11], [2] and references therein. 

For the determination of the weights the situation is somehow different. The existing approaches can 
be divided roughly into three categories: 

1. All weights are assumed to be known and reliable (situation of complete information). 

2. All weights are again assumed to be known but a sensitivity analysis is performed in order to 
get information about the stability of the optimal solution with respect to small changes in the 
input data ([8]). 

3. All weights are assumed to be given with respect to a known distribution ([5], [11] and references 
therein). 

In practice, it may however be the case that the demand u is not known and no probabilistic 
distribution can be provided. Moreover, when replacing the demand u by an estimate u the errors 
made might be rather high and uncontrollable so that a sensitivity analysis would be of no help. 
Examples are the planning of unique and major events for which no knowledge of the demand exists, 
or the planning of installations which are supposed to serve potential clients over a long period of 
time for which the evaluation of demand is unknown. 

In this paper we propose a different strategy for handling the weight estimation problem. Select a 
threshold value J5 > 0 representing the highest admissible transportation cost or just the budget 
given and fix an estimation Cj with positive components. Now, define the robustness p of a location 
X as the minimum deviation in demand with respect to Cj for which the total cost for location x 
exceeds the budget. In other words: Given a norm || • || on the space of weights we have 

p(x) = inf {||a; — a>|| : TC{u)^x) > B, cj > 0} 

By solving then the optimization problem 

max pix) 

we get a most robust location x* . 

The rest of the paper is organized as follows: In the next section the model is introduced and 
some reformulations are discussed. Then a general solution technique is proposed and an illustrative 
example is given. The paper ends with some conclusions and an outlook to further research. 

2. A possible model 

For any feasible location x G 5 C IR^, its robustness p{x) is defined as the optimal value of the 
optimization problem 

inf ||cj — (j|| 
s.t. TC{(jj,x)>B 

CJ > 0 



(1) 
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where || • || is a norm in the space of weights such as 

II II l^o| 

m = max 

" " aeA ’ 

thus measuring the highest relative deviation, or 

ll«ll = max |««| 

measuring the highest absolute deviation, or 

ll«ll = Z! I«-I- 

aeA 

measuring the total absolute deviation, or 

ni = (e««)’ 

measuring the squared root of the sum of squares. 



( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 



The case in which A consists of exactly one point, A = {a}, is trivial: the total transportation cost 
TC{uj, a) from a equals 0, thus p{a) = +oo, and then a is the most robust solution. Throughout the 
paper we will exclude this trivial case and assume hereafter 

A has at least two points (Al) 

2.1 Some reformulations 

Under Assumption Al, TC{u,x) is strictly positive for any u with strictly positive components, 
which implies the following 

Proposition 2.1 For any x 6 IR^, the problem (1) is feasible. In particular, 

p{x) < +00 Vx € IR^ 

Moreover, p{x) can also be expressed as 

p{x) — min ||o; — c2;|| 

s.t TC(u,x)>B (6) 

a; > 0 

By Proposition 2.1, measuring the robustness of a given x amounts to solving the nonlinear opti- 
mization problem (6). We will show below that, under very mild conditions, the optimal value of (6) 
can be obtained explicitly. 

We first recall that a norm || • || in IR” is said to be absolute iff 

||(Ul,U2,...,Un)|| = ||(|Ui|,|U2|,...,|Un|)|| Vu € IR” 

In particular, weighted Ip norms, such as those given in (2)- (5) are absolute norms. For technical 
reasons we assume in the following that 



II • II is an absolute norm 



(A2) 
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Lemma 2.2 Let x such that TC{u,x) < B. Therij 

min {11a; — (2;11 : TC{uj, x) > B, u > 0} = min (Ho; — : TC(a;, x) = B} (7) 

Proof. For any G such that 

TC{uj\x) >B,u^> 0, (8) 

define A as 

TC{uj\x)-B 
~ TC{u^-u,x) 

It follows from the assumptions and (8) that 

TC(cj\x)>B>TC{uj,x), 

thus A € [0, 1). Defining as 

o;^ z= (1 — A)o;^ + Xu, 

it follows that 

TC{u^,x) = TC(u\x)-XTC{u^-u,x) 

= B. 

Hence, since A G [0, 1), 

\\u^-u\\ > (1- A)llo;^-c2;ll 

= 11(1 — A)o;^ + Ad> ~ (jH 
= 

> min (llo; - cjH : TC{u, x) = B} 

Hence, 

p{x) > min (llo; - (j1| : TC{u, x) = B} 

Conversely, given o;^ G such that TC{u, x) = B, define G as 

= max{a;^, 0} \/a e A 

Then, > 0 and 

TC(u\x) = 

aeA 

{o6>l:wi>0} {o€A:u;3<0} 

= Y 

{o6i4 :u;3>0} 

> 

a^A 

= TC(w®,x) 

= B 
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This implies that p{x) < \\u^ — i^ll- 
Moreover, since, 

l^a “ ^ l^a ~ ^o| Vtt € A, 

thus, since any absolute norm is a monotonic norm, [1], 

11^4 _ ^11 < 11^3 _ ^ 11 ^ 



and hence 



||a;^ — > min {||a; — a>|| : TC{uj, x) > u >0} 



Proposition 2.3 For any x eJRr, 



p{x) = max < 0, 



B-TC{u,x) 

\mx))aeA\\^ 



Proof If TC(u,x) > B^ then u is feasible for (6), thus p{x) = 0. If a: satisfies TC{u,x) < B, then, 
by Lemma 2.2, p{x) is the distance (according to metric 11*11) from point Cj G to the hyperplane 
{lj G IRI"^I : TC(uj,x) = B}, Hence, 



p{x) = 



B — TC{(jj^ x) 
\\{da{x))aeAr^ 



and then the result follows. 



From Propositions 2.1 and 2.3 one immediately obtains 
Proposition 2.4 Define Zmed = mmxesTC{u,x). 



1. If ZffiQd B , then 



p{x) = 0 Va: G 5 



In particular, any x £ S is a most robust location. 

2. If Zmed < B, then maxxes p{x) > 0. Moreover, a feasible point x* e S is a most robust location 
iff it solves the problem 

B-TC{u},x) 



Hence, for Zmed > B, the problem is trivial, and will not be considered in the following, by assuming 



^med ^ B 
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3. A general solution approach 



Denote by p the function 



B-TC{w,x) 



( 10 ) 



" ■ ||(d.(x)).s4ir 

By Proposition 2.4, solving ( 1 ) may be reduced to maximizing on S the nonlinear function p defined in 
(10). Function p has, however, a rich structure which enables its maximization by existing methods. 
In particular, we can use the approach of Dinkelbach (see [4]) to get the following iterative solution 
procedure for 

^ N{x) 
maxp(a;) =: : . 

x 65 ^ D{x) 



1 . Find an optimal solution x* for problem max^es N{x). 

2 . q := p[x-). 

3. Compute an optimal solution x' for 



maxA/'(x) - qD{x) ( 11 ) 

4. If N[x') — qD{x') = 0 then STOP: x' is an optimal solution to the fractional program. 

5. q := p{x'). Goto Step 3. 



In [ 12 ] it is shown that the Dinkelbach’s algorithm converges superlinearly and often (locally) quadrat- 
ically. 

Hence, in order to use Dinkelbach’s approach, at each iteration a problem of type (11) must be 
solved. It turns out that problems (11) are manageable at least for a wide class of distance measures. 
Indeed, one has 

Lemma 3.1 Suppose that, for each a £ A, da is induced by a norm in IR^. Then, any problem of 
type (11) to be solved in Step 3 of Dinkelbach 's algorithm has a concave objective. 

Proof. Since || • || is, by assumption, monotone, its dual || • ||® is also monotone [ 1 ]. Hence, the 
function x i-> ||(da(a;))aeAir convex, since it is the composition of the convex functions da with 
the monotonically increasing convex function || • ||®. Moreover, by Assumption A3, p{x*) > 0, 
and by construction of N and D, each q obtained in Step 5 is also positive, thus the function 
X g'||(da(a;))aeA||° is convex, from which the result follows. □ 



Hence, as soon as the feasible region 5 is a convex set, the optimization problem in Step 3 is a 
maximization of a concave function over a convex set (or equivalently a minimization of a convex 
function over a convex set) for which numerous algorithms exist (see, for example, [7]). 



Example 3.1 We are given 10 existing facilities ai = (0,0), U 2 = (1,0), = (1,-1), 04 = (0,1), 

05 = ( 0 , 2 ), 06 = (5,6), 07 = (6,3), Os = (8,4), 09 = (10,5) and Oio = (6,10). The estimator for 
the weights is u) = (4, 4, 4, 4, 4, 1 , 1 , 1 , 1 , 1) and the budget B is 100. The distance da(x) is measured 
by the Euclidean norm and also the deviation in the space of weights is measured by the Euclidean 
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norm. Note that the dual of the Euclidean norm is again the Euclidean norm. Therefore our objective 
function for finding the most robust location is now 

-/X 100-Ea€/t‘^a^2(x,g) 

||i2(x,a)„ex||2 ’ 

where || • H 2 denotes the Euclidean norm. Dinkelbach’s algorithm needs 3 iterations to find the most 
robust location x = (0.758401,0.652517) with objective value 1.8481. Note that in every iteration a 
nonlinear program has to be solved. The solutions to the single iterations are given in the following 
table 



Iteration 


X 


p{x) 


0 


(0.58947,0.46865) 


1.82525 


1 


(0.75621,0.65004) 


1.84809 


2 


(0.75840,0.652516) 


1.84810 


3 


(0.758401,0.652517) 


1.84810 



In Figure 1 the solutions of the different iterations are shown. 




Figure 1: Dlustration for Example 31. 



Anyway, Dinkelbach’s approach is not the only option to maximize p. We recall that a function / is 
said to be explicitly quasiconcave if both upper level sets and strict upper level sets are convex sets, 
see e.g. [10] for further details. 
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Lemma 3.2 Suppose that, for each a G A, da is induced by a norm in IR^. Then, p is explicitly 
quasiconcave. In particular, for S convex, any local maximum of p is also a global maximum on S. 

Proof. It has been shown in the proof of Lemma 3.1 that the function x i-> ||(da(x))aeyi||° is convex, 
and it is obviously positive. The result then follows from the algebra of convex functions (see [10]). 
□ 

Hence, any local-search procedure leads to global optimality. 

4. Conclusions 

In this paper we have addressed a planar single-facility location problem in which a high level of 
uncertainty is involved in the demand vector. 

The concept of robustness of a feasible solution x as a measure of the acceptance of x is introduced, 
and the most robust location is then sought. 

Finding the most robust location amounts to solving a nonlinear fractional problem, solvable by 
existing methods such as Dinkelbach’s algorithm when distances are induced by norms. More efficient 
procedures are possible when further assumptions (e.g. distances measured by the Manhattan norm) 
are made (see [3]). 

The concept of robustness could also be used in another usual location setting, namely, location on 
networks, leading again to nonlinear fractional programs which, under further assumptions on the 
norm || • ||, can be solved by inspecting a finite set of candidate points. 

Another interesting extension of this model is obtained if not only the robustness but also the actual 
transportation cost are taken into account via a biobjective problem, which again becomes piecewise 
linear and tractable under polyhedrality assumptions on || • ||. 

These extensions are currently under research. 
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Paret6-Optimal and Compromise Decisions in Problem with 

Fuzzy Criteria 



Jaroslav Ramik, Silesian University, Karvina, Czech Republic 

Summary: This paper deals with the problem of existence Pareto-optimal decisions, the existence 
of compromise decisions and conditions for a compromise decision to be Pareto-optimal. The concept 
of compromise decision is generalized by adopting triangular norms. 

1. Introduction 

Modeling many decision or, optimization problems we meet two groups of elements: alternatives and 
criteria. By the value /t(a), the effect of the alternative a from X measured by the z-th criterion on 
the scale IR^ is expressed, e.g. in money units, physical units as grams, meters, watts, etc. However, 
each measurement can be subjected to imprecision of the measuring devices, and also the measured 
magnitudes of a criterion have usually nonlinear significance for the decision maker (DM). This is, 
among others, the reason for using utility theory within the framework of multicriteria analysis. 
Here we adopt another approach by introducing fuzzy criteria on the scale IR^ Using the z-th fuzzy 
criterion, the significance (satisfaction) lies between 0 and 1, the membership value (pi{x) = 0 means 
the least significance of (or, satisfaction with) x = fi{a) for the DM. On the other hand, by (pi{x) = 1 
the highest possible significance of x is meant, i ^ I = {1,2, ...m}, here I is an index set for the 
criteria. The higher the membership value the better the value x. The optimization (or, decision) 
problem is to find an ’’optimal alternative”, or, ’’optimal decision” a* from X with respect to criteria 
H- = (p- o/j called fuzzy criteria. A fuzzy criterion is given by its membership function from IR” to 
unit interval [0, 1]. 

Within the framework of the above mentioned decision situation, in this paper we deal with the 
existence and mutual relationships of three kinds of ’’optimal decisions”: Weak Pareto-Maximizers, 
Pareto-Maximizers and Strong Pareto-Maximizers - particular alternatives satisfying some natural 
and rational conditions. Here, they are commonly called Pareto-optimal decisions. We study also 
the compromise decisions x* e X maximizing the min{/ii(x); z € /}, introduced by Bellman and 
Zadeh in [1], we call them max-min decisions. The set X considered here is a convex subset of 
n-dimensional Euclidean space IR”. The main subject of our interest is to derive conditions for a 
compromise decision to be Pareto-optimal. Moreover, we generalize the concept of the compromise 
decision by adopting triangular norms (so called t-norms) and extend the theorems for max-min 
decisions to the corresponding max-t decisions. We derive the results for n-dimensional Euclidean 
space IR” with n > 1, however, some results, can be derived only for IR^ = IR. 

2. Generalized Concavity, Fuzzy Criteria 

The following generalized concavity concepts are discussed in more details e.g. in [7]. We apply the 
concavity concepts to the membership functions of fuzzy sets given by membership functions /x : 
IR” [0, 1]. 
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Definition 2.1 An a-level set [/i]a, a € [0, 1], of a fuzzy set given by membership function // : 
IR^ — )> [0, 1] is defined as 

Mq = {x € ^l{x) > a}. (1) 

A support of a fuzzy set /i : IR” [0, 1] is defined as 

supp{p) = {x € TR'^]fi{x) > 0}. (2) 

Definition 2.2 A real function p : IR”— >[0, 1] is called quasiconcave if 

H(\x + (1 - X)y) > min{/i(a:), /z(y)}, (3) 

for every x, y € IR”, 0 < A < 1. 

Definition 2.3 A real function p : IR” — ^ [0, 1] is called strictly quasiconcave if 

fi{Xx + (1 - X)y) > mm{n{x),iM{y)}, (4) 

for every x, y 6 supp(/x), x ^ y and 0 < A < 1. 

Definition 2.4 A fuzzy set p, : JR^ [0, 1] is bounded if there exists a e [0, 1] such that [p]a is 
non-empty and bounded (in the usual sense). 

The following proposition is a special case of Theorem 3.1 in [7]. 

Proposition 2.1 A real function p : IR” -> [0, 1] is quasiconcave if and only if (iff) for every 
a e [0, 1], [p]a is a convex set 

The proof of the proposition is straightforward. Notice that (3) holds for every x, y € IR”, 0 < A < 1, 
whereas (40 < A < 1. 

Proposition 2.2 Let p : IR” [0, 1]. If p is strictly quasiconcave then p is quasiconcave. 

In what follows we shall deal with a special type of fuzzy sets given by membership functions p : 
IR'^ [0, 1], satisfying two properties: convexity and normality. 

Definition 2.5 A fuzzy set given by the membership function /x : IR” ->> [0, 1] is called a fuzzy 
criterion if 

(i) p is quasiconcave on IR” O' convexity property"), 

(a) there exists x € IR” with p{x) = 1. ('"normality property”). 

The following proposition gives the characterization of fuzzy criteria on IR\ the proof can be found 
in [8]. 

Proposition 2.3 Let p : IR [0, 1] be a fuzzy criterion. Then there exist a, p e [0, 1], a, b,c,d e 

JR U {+ 00 , — oo}, a < b < c < d, such that 

p(x) = a for X < a, 

p{x) is non- decreasing for a < x < b, 

p(x) = 1 for b < X < c, 

p{x) is non-increasing for c < x < d, 

p{x) = p for d < X. 
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3. Pareto-Optimal Decisions 

Now, we consider a decision problem with the set of fuzzy criteria given by membership functions 
pi : X [0,1], i £ I , where X C IR” is a convex set, n is an integer, n > 1, / = {1, 2, ...m} is an 
index set, m is an integer, m > 2 . We look for "best” decisions x* £ X, with respect to the above 
fuzzy criteria. 

Definition 3.1 (i) A decision xwp ^ X is said to be a Weak Pareto-Maximizer (WPM), if there 

is no X € X such that 

Pi{xwp) < for every i E I. (5) 

The set of all Weak Pareto-Maximizers in X is denoted by Xwp- 

(ii) A decision xp e X is said to be a Pareto- Maximizer (PM), if there is no x ^ X such that 

< fJ'iix), for every i e I, (6) 

and at least one of the above inequalities is strict. The set of all Pareto-Maximizers in X is 
denoted by Xp. 

(Hi) A decision xsp € X is said to be a Strong Pareto-Maximizer (SPM), if there is no x E X, 
X ^ xsp, such that 

fJ'iixsp) < for every i E I. (7) 

The set of all Strong Pareto-Maximizers in X is denoted by X^p. 

The Weak Pareto-Maximizers, Pareto-Maximizers, and Strong Pareto-Maximizers are called 
Pareto- Optimal Decisions. 

The following proposition follows directly from the definition. 

Proposition 3.1 Any SPM is PM, and any PM is WPM, i.e. 

Xsp C Xp C Xwp- (8) 

Example 3.1 To illustrate the above concepts of Pareto- Optimal Decisions, let p\ and p 2 be as in 
Figure 3.1, then Xwp = [ a,g], Xp = [6,c]U]d,e[U{/}, Xsp =]d,e[U{/}. 

The following example shows that in some simple situations Xsp may be empty. In the next section we 
find sufficient conditions such that Xsp contains just one element, which is important from practical 
point of view. 

Example 3.2 Let /xi, p 2 be as in Figure 3.2. Here, /xi, /X 2 are continuous and evidently Xwp = 
[a,d],Xp = [b,c], however, Xsp = 0. 

Proposition 3.2 Let pi , i £ I, be membership functions of strictly quasiconcave fuzzy criteria and 
let Xi ^ X = IR be such that /Xj(xj) = 1, x E /. Then Xwp = Xp = Xsp = co{xi] x E /}. 
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Fiqure 3.1 




Figure 3.2 



Proof. 

Set Xj = min{^i; i G /}, = max{^i; i 6 /},then 

co{xi\ie /} = [xj,Xk]. 

Let X G [xj,Xk] and suppose that x ^ Xsp. Then there exists y with iii{y) > /ii(a;), for all i G /. 
\iy < X, then by strict quasiconcavity of fik we get fik (y) < y>k (x) j a contradiction. On the other 
hand, if a: < y, then again by strict quasiconcavity of fij we get (y) < (x) , a contradiction. 

Hence co{xi\ i e 1} = [xj,Xk] C Xsp C Xwp- 

Suppose X ^ [xj,Xk], e.g. x < Xj. By strict monotonicity of fii, we get /Xj(x) < fii{xj), for all i G /, 
hence, x ^ Xwp- 

Assuming x > we obtain by analogy the same result. Then Xwp C [xj, xjk], the rest of the 
proposition follows from Proposition 3.1 and (8). □ 

4. Compromise Decisions 

In the theory of multi-objective optimization, the ’’compromise decision”, or, ’’compromise solution” 
is obtained as the solution of single-objective problem with the objective being a convex combination 
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of all criteria. 

In this section we investigate some properties of a commonly used compromise decision x* G X, 
maximizing the minimum of all criteria, i.e. min{/i(a ;);2 G /}, where X C IR^ is a convex set, 
I = {1, 2, ..., m}. The original idea belongs to Bellman and Zadeh in [l]in decision analysis. 

Definition 4.1 Let /ii, i e I, be the membership functions of fuzzy criteria. A decision x* ^ X is 
called a max-min decision, if 

min{ni(x*)\ i e 1} = max{min{/Xi(rr); i £ I}; x e X}. (9) 

The set of all max-min decisions in X is denoted by Xm> 

The following 2 propositions investigate the existence of max-min decisions. In IR^ the boundedness 
of the criteria is not necessary for existence of nonempty Xm, whereas the same result does not hold 
in IR"^ with n > 1, as is demonstrated on Example 4.2. 

Proposition 4.1 Let fXi, i £ I, be the membership functions of upper semicontinuous (USC) fuzzy 
criteria on IR. Then Xm 7^ 0- 

Proof 

Setting Xj = min{xi;i G I},Xk = max{xi]i £ /}, then co{xi\i € /} = [xj.Xk] . By Proposition 2.2 
, fii are non- decreasing in ] — oo,xj] and non-increasing in [x)b,H-oo[ for all i £ I . Then (p = 
mm{fjLi]i £ 1} is also non- decreasing in ] — oo^Xj] and non-increasing in [x/fc,-hoo[ . As (li are 
upper semicontinuous then (p is USC on IR, too, particularly on the compact interval [xj,Xk]. Hence 
p) = min{/ii;z G /} attains its maximum on [xj,Xk], which is also a global maximum on X. This 
maximum is a max-min decision, i.e. Xm ^0- ^ 

Example 4.I This example demonstrates that semicontinuity is essential in the above proposition. 
Let 

p,i(x) = 1 for X <0, 

fjii(x) = 0 for X >0. 

ji 2 [x) = e^ for a; < 0, 

pi 2 [x) = 1 /or a; > 0. 

Here, /xi,/i 2 are the membership functions of fuzzy criteria, p 2 is continuous, pi is not upper semi- 
continuous on IR. It is easy to see that 'ip{x) = min{/xi(x), /X 2 (a:)} does not attain its maximum on 
IR, i.e. Xm = 0- 

Proposition 4.2 Let pi, i £ I, be the membership functions of bounded upper semicontinuous 
(USC) fuzzy criteria on IR”. Then Xm 7^ 0- 

Proof. 

As /ii, i £ /, are bounded USC on IR”, then (p{x) = mm{pi{x)\i £ I};x £ X is bounded USC on 
IR”, too. Consequently, p) attains its maximum on IR”,i.e. Xm 7^ 0- ^ 
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Example 4-2 This example demonstrates that Proposition 4-1 does not hold in with n > 1, 
particularly, n = 2. Set 

Hi{xi,X 2 ) = max{0,l - 

X2-^l 
X\ ~~ X 

/i2(xi,X2) = max{ 0 ,l-{ — r)^}, 

X2 + I 

X = {{xi,X2) € R^;x2 > 0}. 

It can be easily verified that fii and P 2 continuous and concave fuzzy criteria on X, consequently, 
the same properties hold for 



<p(xu X2) = min{fii{xi,X 2 ), X2)}. 



It is not difficult to find that (p{xi,X 2 ) < 1 and 



(p{0, X2) = l- 



1 

{X2 + 1)2 



for (0,X2) 6 X. Since lim(^(0, 0 : 2 ) = 1 for X 2 -> + 00 , ip does not attain its maximum on X, i.e. 
= 0 • 



In what follows we investigate some relationships between Pareto-optimal decisions and max-min 
decisions. In the following proposition quasiconcavity and normality conditions of /Xj are not required. 



Proposition 4.3 Let fii,i e I, be membership functions of fuzzy sets on X = IR”. Then Xm C Xwp- 



Proof. 

Let X* G Xm- Suppose that x* is not a Weak Pareto-maximizer, then by Definition 3.1 there exists 
x' such that Pi{x*) < pLi{x'), for all i e I . Consequently, min{/Xi(a:*); z G /} < min{fXi{x')]i G /}, 
which shows that x* is not a max-min decision, a contradiction. □ 



Proposition 4^4 Let pi, i e I, be the membership functions of fuzzy sets on X = IR”, let Xm = 
{x*}, i.e. X* e X be a unique max-min decision. Then Xm C Xsp- 

Proof. 

Suppose that x* is a unique max-min decision and suppose that x* is not a SPM. Then there exists 
x'^ e X, X* ^ x~^, fjii{x*) < fjLi{x'^), i e I. Thus min{/Xi(a:*); z G /} < min{/xi(x'*'); z G /}, which is a 
contradiction with the uniqueness of x* G Xm- □ 



Proposition 4.5 Let pi, i £ I, be all strictly quasiconcave membership functions of fuzzy criteria 
on X — IR” and let x* G Xm such that 

0 < min{/Xj(a;*);z G /}. (10) 

Then Xm — { 2 :*}; i-e. Xm consists of the unique element. 
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Proof. 

Let ip{x) = mm{ni{x)\ i e 1} and suppose that there exists x' £ Xm, x* ^ x' . Then 

(p(x*) = (p(x') = S > 0. (11) 

As by (10) we have x*,x' € supp{ip), setting x"*" = Ax' 4- (1 — A)x*, for 0 < X < by strict 
quasiconcavity we obtain ip{x'^) > min{<^(x'), <^(x*)} = S, being a contradicttion with the fact that 
(p{x*) = max{v?(x); x G X}. □ 

Remark By Propositions 4-4 ^>nd 4-5, if x* £ Xm with (10), then x* £ Xsp. 

5. Generalized Compromise Decisions 

In this section we generalize the concept of the compromise decision by the use of triangular norms, 
see also [5]. 

Definition 5.1 A triangular norm t (briefly t-norm) is a function t : [0, 1] x [0, 1] — >> [0, 1], satisfying 
the following conditions: 

(i) t{a, 1) = a for every a £ [0, 1], (’’boundary condition”) 

(ii) t{a, b) < t{c, d) whenever a < c,b < d, a,b,c,d £ [0, 1], (’’monotonicity”) 

(iii) t{a,b) = t{b,a) for every a,b £ [0, 1], (’’commutativity”) 

(iv) t{t(a, 6), c) = t{a, t{b, c)) for every a,b,c £ [0, 1]. (’’associativity”) 

A t-norm is called strictly monotone if 

t{a,b) < t{c,d) whenever a < c and b < d. (12) 

Examples of t-norms. 

Let a,b £ [0, 1]. Define 

ti(a, b) = min{a, b}, (13) 

t2{a,b) = a.b, (14) 

ts{a, b) = max{0, a 4- 6 - 1}, (15) 

U{a,b) = min{a, 6} if max{a, 6} = 1, (16) 

= 0 otherwise. 



Then ti-norm is continuous and strictly monotone, ^ 2 -norm is continuous and strictly monotone, 
too, ta-norm is continuous, but not strictly monotone and t 4 -norm is neither continuous, nor strictly 
monotone. 
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Remark 5.1 By commutativity and associativity properties (Hi) and (iv) in Definition 5.1, t-norm 
may be extended to n-ary operation, e.g. a 3-ary operation t{a\,a 2 ,az) = t{a\,t{a 2 ,az)), etc. 

Definition 5.2 Let /Xj, z € /, he the membership functions of fuzzy criteria on X = IR”, t be a 
t-norm. A decision x* £ X is called a max-t decision, if 

t{fj,i{x*),...,firn(x*)) = max{t(pi(x),...,fj>m{x))\x e X}. (17) 

The set of all max-t decisions on X is denoted by Xt(/J>i, ...,/JLrn)^ shortly, Xt. 

In the previous section we have investigated some properties of the compromise decisions considering 
a particular t-norm t = min. In what follows we extend the results for more general t-norms. The 
following propositions generalize Propositions 42, 43, 44, and 45, the proofs are straightforward. 

Proposition 5.1 Let pLi,...,pLm the membership functions of USC bounded fuzzy criteria on X = 
IR”, t be a continuous t -norm. Then Xt^ it. 

The next proposition is an extension of Proposition 43. 

Proposition 5.2 Let /Xi,...,/Xm be the membership functions of fuzzy sets, t be a strictly monotone 
t-norm. Then Xt C Xwp- 



The following proposition is a generalization of Proposition 44. 

Proposition 5.3 Let ..., pim be the membership functions of fuzzy sets, let Xt = {x*},i.e. x* e X 
be a unique max-t decision, t be a t-norm. Then Xt C Xsp. 

The following proposition is an extension of Proposition 45. 

Proposition 5.4 Let /ii,...,/Xm be strictly quasiconcave membership functions of fuzzy criteria, t be 
a strictly monotone t-norm and let x* 6 Xt such that 

0 < mm{fjLi(x*)] i € /}. (18) 

Then Xt = {x*}, i.e. Xt consists of the unique decision x*. 

Remark 5.2 By the last proposition, for every x* G Xt with (18), it follows that x* G Xsp. 



Remark 5.3 The proofs of the above propositions may be obtained by adaptations the proofs of the 
corresponding propositions from the previous section. The proofs for the case m = 2, n = 1 can be 
also found in [8]. 
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6. Conclusion 

In this paper we have dealt with the multi-criteria optimization problem where the criteria have 
been modelled by fuzzy sets. We have investigated mutual relationships among 3 types of ’’optimal 
decisions” : Weak Pareto-Maximizers, Pareto-Maximizers and Strong Pareto-Maximizers - commonly 
called Pareto-optimal decisions. Moreover, we have studied the compromise decisions maximizing 
the minimal grade of satisfaction of the alternatives, called max-min decisions. The main subject of 
our interest has been focused on conditions of compromise decisions to be Pareto-optimal. We have 
also generalized the concept of the compromise decision by adopting triangular norms (t-norms) and, 
finally, extended the theorems derived for max-min decisions to the corresponding max-f decisions. 
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Single-Machine Scheduling with Mixed Precedence 

Constraints 

Eugene Levner, Center for Technological Education Holon 
Milan Vlach, Japan Advanced Institute of Science and Technology 

Summary: The paper deals with a single machine scheduling problem involving a general prece- 

dence structure that permits both ordinary and fuzzy precedence constraints. Feasible schedules are 
evaluated not only by their cost but also by the degree of satisfaction with their precedence structure. 
An 0(m log n + max(n^, A;n^))-time algorithm is proposed for finding nondominated solutions of the 
resulting bi-criteria scheduling problem where n is the number of jobs, k is the number of fuzzy 
constrains, and m is the total number of precedence constraints. 

Keywords: Machine scheduling, fuzzy precedence, polynomial algorithm. 

1. Introduction 

There are many scheduling problems of practical interest in which the input data are uncertain or 
imprecise, and their forms of uncertainty or imprecision are not of probabilistic nature. In situations 
where objective functions or constraints are neither deterministic nor probabilistic, the problems may 
often be modelled with the help of fuzzy sets. In this paper we study a single machine scheduling 
problem involving both ordinary (crisp) and fuzzy precedence constraints. Scheduling problems with 
such constraints occur quite frequently in modern automated systems. As example consider the 
following situations. 

Suppose we have a multi-machine robotic cell in which a robot (an automated guided vehicle) is 
responsible for loading/unloading and transporting parts between machines. The parts visit the 
machines, following given routing tables, thus generating demand for the robot. Various possible 
equipment layouts in such cells are presented, for example, in [1,8,12,13]. The due dates of robot 
operations within a production cycle are specified by a process engineer, according to specific require- 
ments of a production process. Further, the technological requirements impose that certain robot 
operations must precede some others. The decision maker is faced with the problem of finding a 
robot’s schedule subject to precedence constraints and minimizing some cost function depending on 
completion times of robot operations. 

Another example relates to a pick-and-place robot serving an automated warehouse. Suppose n 
items (orders) are waiting on their places in the warehouse for being sent to the customers. The 
pick-and-place robot has to serve them in turn, that is, to unload and transport them to the output 
one after another. Due dates and unloading/ transportation times are known for each item. Some 
items may be unloaded only after some others were, and these precedence constraints on robot’s 
operations are specified in advance. Again, the decision maker is to find a schedule for the robot 
such that the precedence constraints are satisfied and a given function of operation completion times 
is minimized. 

Our next example relates to the mission planning of a satellite (a space robot) [3,4]. Consider a 
satellite following orbits around the Earth in order to take shots corresponding to images requested 
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by various customers. One of the decision problems to be solved is the daily task planning which 
consists in sequencing of tasks (shots) to be performed by the satellite during a planning period. 
Due dates and operation durations are known for each task. Precedence constraints between the 
tasks that range them according to their scientific, military, political or commercial importance are 
determined by experts and specified in advance. Since all the images requested cannot usually be 
satisfied in time, a penalty function is given for each task the value of which depends on time when 
the task is performed. In order to observe the interests of all customers, a shot sequence minimizing 
the maximum penalty (or some other function of penalties) is to be found. Often the quality of 
the resulting shot sequence must be evaluated by taking into account this criterion as well as the 
satisfaction of the decision maker with the task ordering. 

In all examples given above, the precedence constraints may be crisp or fuzzy. The crisp precedence 
constraints describe the ordering between the operations which is not liable to any change under any 
circumstances (for example, it is a given technological order) while the fuzzy precedence constraints 
describe a more fiexible situation where some pairs of operations are not linked by technological 
constraints but the decision maker prefers one ordering to another according to her or his intuition 
or experience. The choice of an order among these two possible alternatives for each pair of such 
operations is a decision variable. As we shall see below, this choice may be carried out in such a 
way that the degree of satisfaction of the decision maker with a schedule is maximized. The exact 
meaning of this criteria will be explained in the next section. 

In the case when all precedence constraints are crisp, we obtain a classical single-machine n-job 
scheduling problem for which a great number of results is reported in the scheduling literature. 
Depending on the specific form of the objective functions and constraints of the problem, many 
versions of the problem are known to be polynomially solvable while a lot of others are proved to be 
NP-hard (see, for example, the extensive survey [10]). However, not too much is known about this 
scheduling problem when fuzzy precedence constraints are present. One of the papers dealing with 
such constraints is a recent paper by Ishii and Tada [6]. Ishii and Tada deal with a single machine 
problem of minimizing maximum lateness in the presence of fuzzy precedence constraints. They 
proposed an efficient combinatorial algorithm solving the problem in cubic time (with the problem 
size being measured in the number of jobs). 

In this paper, we extend the above model in the following two aspects. Firstly, we deal with 
minimizing the maximum of arbitrary nondecreasing cost functions of completion times. Sec- 
ondly, we consider a rather general precedence structure which includes both crisp and fuzzy prece- 
dence constraints. Following the Ishii-Tada approach, we take the degree of satisfaction of the 
decision maker with respect to the precedence structure as the second criterion, and propose an 
0(m log n -h max(n^, /cn^))-time algorithm for finding nondominated solutions of this bi-criteria 
scheduling problem, where n is the number of jobs, k is the number of fuzzy constrains and m 
is the total number of crisp and fuzzy constraints. Note that k or even m may be zero. 

In the next section we present the problem formulation. In Section 3 we present a new strongly poly- 
nomial algorithm for finding nondominated solutions of the resulting bi-criteria scheduling problem. 
In Section 4 we analyze the algorithm and obtain, as ’by-products’, refinements of the Lawler and 
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Ishii-Tada algorithms. Section 5 summarizes the results of the paper and discusses some directions 
for further research. 

2. Problem formulation 

We consider the following scheduling problem with mixed (crisp and fuzzy) precedence constraints. 
There are n jobs Jj, j = 1, . . . , n, each with a specified processing time pj, to be scheduled for pro- 
cessing by a single machine that can work on at most one job at a time. The machine is continuously 
available and no preemption is permitted, i.e. each job Jj must be processed continuously for time 
Pj. To avoid computational complication we assume that all processing times are positive rational 
numbers. In addition to the values of Pj, the following two types of precedence constraints may be 
given: 

1. Crisp constraints. These constraints are given by an irrefiexive partial order on the set V = 
{1,2, ...,n}, which is usually derived from an acyclic digraph G = {V^E) where each node 
j £V represents job Jj and each directed path from j to k represents the requirement that job 
Jj has to be completed before the processing of job Jk can start. 

2. Fuzzy constraints. These constraints do not restrict the feasibility of a schedule by requir- 
ing that certain jobs must precede some other jobs. Instead, they refiect the decision maker 
preferences between possible orders. Each fuzzy precedence constraint is specified by an un- 
ordered pair of different jobs, say U = {JuJj} and a membership function pu defined on the 
corresponding two-element set of ordered pairs Jj), (Jj, J»)}. The values pu(JijJj) and 
Pu{Jjy Jj) express the degree of satisfaction when job Jj precedes job Jj and when Jj precedes 
Jj, respectively. We assume that each membership function pu is normal and unimodal in the 
sense that one of the values pu{Ju Jj) and PuiJjt Ji) is 1, and the other is strictly between 0 
and 1. 

It is natural to require that fuzzy precedence constraints cannot be imposed on the pairs of jobs 
subjected to crisp constraints. In other words, we assume that exactly one of the following three 
variants is associated with every ordered pair of different jobs Jj and Jj i Either Jj and Jj are subject 
to a crisp constraint, or Jj and Jj are subject to a fuzzy constraint, or Jj and Jj are independent. 
Throughout the paper we consider only nonpreemptive schedules without inserted idle time. Such 
schedules are completely characterized by permutations of the set of job indices (1, 2 , . . . ,n}, and 
we sometimes identify schedules with permutations of (1, 2, . . . , n}. However we should notice that 
only the permutations satisfying all crisp constraints are considered to be feasible. Moreover we 
are not completely free in selecting the processing order of two jobs which are subject to fuzzy 
constraints, because some choices may be inconsistent with given crisp constraints. To avoid possible 
misunderstanding we say that a permutation tt (or schedule tt) of {1, 2 , . . . , n} is compatible with a 
directed graph with node-set {1, 2, ... ,n} if job Jj precedes job Jj (not necessarily immediately) in 
the schedule determined by tt whenever (i, j) belongs to the transitive closure of the graph. 

The quantitative measures for evaluating schedules we are interested in are based not only on the 
degrees of satisfaction expressed by the membership functions of fuzzy constraints but also on cost of 
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schedules expressed by suitable functions of job completion times. Consequently, we assume that a 
nondecreasing cost function fj is associated with each job Jj whose value fj{Cj) determines the cost 
incurred when job Jj is completed at time Cj. As a result we obtain bi-criteria scheduling problem. 
One objective is to minimize the maximum cost and the other is to maximize the minimum degree of 
satisfaction with the job order. Before proceeding to an exact definition we simplify the notation by 
writing instead of Jj)j and notice that these values are given only for pairs of jobs subject 

to fuzzy constraints. We define them also for all remaining pairs of jobs by setting: = 1 and 

fjiji = 0 for each pair (i^j) belonging to the transitive closure of the crisp precedence graph G, and 
/jLij = fjLji = 1 for each pair of independent jobs Jj and Jj. 

For each permutation tt of {1, 2, . . . , n} that is compatible with all crisp constraints, we define 

( 1 ) 

/maiW = jmM/j(Cj(7r)) (2) 

where Cj{n) denotes completion time of job Jj under the permutation schedule given by tt. 

Since it may happen that no feasible schedule optimizes fmax and ^irnin simultaneously, we are inter- 
ested in finding schedules which are nondominated with respect to the following dominance relation: 
A schedule tti is said to be dominated by a schedule 7T2 if /mai(7Ti) > fmax { 7 ^ 2 ) and /imin(7Ti) < Mmm(7T2) 
and at least one of these two inequalities is strict. If tt and tt' are such that fmaxM = /max(Tr') and 
Mmm(7r) = then we call them equivalent. Consequently, every equivalence class of schedules 

consists of all schedules with identical value of fmax and identical value of fimin- 
Now the problem we are addressing in this paper can be formulated in the following way. 

Problem P: Find one schedule from each equivalence class of nondominated feasible schedules. 

Let us note that this problem includes several known scheduling problems as special cases. For 
example: 

• If only crisp precedence relation is imposed, then we have the problem studied by Lawler [9]. 

• If only fuzzy precedence constraints are presented and each cost function is the lateness, then 
we have the problem studied by Ishii-Tada [6]. 

• If all precedence constraints are absent and cost functions are membership functions measuring 
the degree of dissatisfaction with completion times, then we have the problem with fuzzy due 
dates studied for example in [5,7,14,15]. 

3. Problem analysis and algorithm 

In what follows Gq stands for a digraph defined for each 0 < a < 1 as follows: If a = 1, then 
Ga = Gi is the digraph obtained from the crisp precedence graph G by adding to it all arcs (z, j) 
corresponding to the fuzzy constraints with /iy = 1. If 0 < a < 1, then Gq is the digraph obtained 
from Gi by deleting all arcs (z, j) such that /Zy = 1 and /Zjj > a. Note that graph Gq may contain 
directed cycles for some values of a. 
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For ease of description of solution procedures for problem P, we consider also the following auxiliary 
problems where 0 < a < 1: 

Minimize /max(7r) 
subject to /Xmin(7r) > a, 

7T is compatible with Gq 

As pointed out in the previous section, Ishii and Tada [6] proposed an 0(n^)-time algorithm for 
solving problem P in the case that no crisp constraints are present, graph G\ is acyclic, and fmax is 
the maximum lateness. For reader’s convenience we recall the basic steps of their algorithm. 

ISHII-TADA ALGORITHM 
Step 1 [Initialization] 

1.1 Arrange all from the open interval (0, 1) in a nondecreasing order and rename the resulting 

k different values so that 1 > > /lz(2) > • • • > ^{k) > 0. 

1.2 Set q := 0, /i(^) := 1, a := /i(^), L := 0. 

1.3 Construct Ga as described above. 

Step 2 [Solving Pa] Minimize the maximum lateness subject to the precedence relation given by 
Gq. If the resulting solution tTq is not dominated by any schedule from L and tTq is not yet in 
L, then include it in L. 

Step 3 [Arc deletion] ' U q < k, then set := ^ + 1, a := fi{q), construct Ga by deleting from 
G^(g-i) all arcs (ij) with fiji = a, and go to Step 2. If q = k, then stop (the current L 
provides the required set of nondominated solutions). 

The extension of the Ishii-Tada approach to the general case proposed in this paper is based on the 
following four observations. 

Observation 1 Solution of problem Pa in the second step of the Ishii-Tada algorithm can be im- 
plemented in polynomial time not only for the maximum lateness but also for an arbitrary function 
fmax defined by (2), provided all fj are nondecreasing and their values can be computed in constant 
time. 

Proof. The well known algorithm of Lawler [9] provides the required possibility. To obtain a concise 
description of this algorithm we use the following notation. Let the precedence relation under con- 
sideration be gi6en by an acyclic digraph G = (V, E). Without loss of generality we assume that V 
is the set of the job indices {1, 2, . . . , n}. For each subset / of V, let Gj be the digraph obtained by 
restricting G to I. Furthermore, for each i G /, let T»(G/) denote the set of all nodes in / preceded 
(not necessarily immediately) by node i, and let Ei{Gi) denote the set of all arcs of G/ entering node 
i. Now we are ready to describe the algorithm. 
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LAWLER’S ALGORITHM 

I:=V^ A:=E, u:=Y.ieiPi 
For j = n down to 1 do: 

Choose i G / such that 
fi{u) = min{je/|rj(G/)=0} fj{u) 

I:=I\{i),A-A\Ei{Gj) 
u := u — Pi 
n{j) := i 
output (tt) 

The algorithm determines the index 7 t(z) of the job in the i-th position of an optimal permutation 
schedule, and the analysis presented in [9] shows that the algorithm can be run in 0(n^ )-time. □ 

Observation 2 If graph Ga contains a cycle, then problem P has no feasible solution tt with 

Proof, (by contradiction). Assume that problem has a feasible solution tt, and that graph Gq 
has a directed cycle C. Rename the nodes of V so that tt = (1, 2, . . . , n). Let C = {vi,V 2 j . . . ,Vt,Vi) 
where t < n. Let u denote the smallest node in C, and w the node standing just before u in C. [For 
instance, if tt = (1, 2, 3, 4, 5, 6) and C = (6, 5,3, 6) then u = 3 and w = 5]. Consider two possible 
cases. 

Case A. Suppose that arc {w,u) of C (in graph Go) corresponds to a crisp constraint. 

According to the definition of the crisp constraint, it means that job must precede job Ju in all 
feasible schedules. On the other hand, any feasible permutation (and tt as well) determines uniquely 
an order of processing the jobs. Since, in accordance with our choice, u < li; in tt, then Ju is to 
be processed before in schedule tt. This condition, taken together with the above one that 
precedes Ju^ contradicts our basic assumption that the crisp precedence graph G = (V, £") is acyclic. 
Case B. Arc {w,u) in Ga corresponds to a fuzzy constraint. 

According to the construction of G\ and Gq, pu},u = 1- However, Ju precedes Juj in tt (because of 
u < w). Therefore, puw will be participating in defining the value /Xmin(7r) according to (1). Recall 
now that graph Gq is constructed by deleting from G\ all arcs with pji > a, leaving only those (z, j) 
for which pji < a. Then prnini'^) < Q:, which contradicts feasibility of tt for problem Pq. This proves 
the claim. □ 

Observation 3 If graph Gp contains a cycle for some 0 < ^ < 1, then the maximum value of a in 
the interval 0 < o; < ^ for which Gq is acyclic can be found by a binary search. 

Proof First we not)ce that if problems Pc. and Fq 2 with 0 < < 02 < are such that the 

corresponding graphs Gq^ and Gqj have cycles, then also the graph Gq corresponding to Pa contains 
a cycle whenever 0 < oi < a < 0 : 2 . This is obvious because Gq^ is a subgraph of Gq for each 
0 < oi < a < ^. It remains to show that there exists a in the interval {0,13) such that Gq is 
acyclic. According to the assumption, graph Gp contains a cycle. It follows that Gp contains arcs 
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corresponding to fuzzy constraints, because the graph defining the crisp constraints is assumed to be 
acyclic. It follows, also from the acyclicity of the crisp precedence, that we can obtain an acyclic graph 
Got with 0 < a < by deleting from sufficiently many arcs corresponding to fuzzy constraints. 
Now, if graph Gp contains a‘cycle we can arrange all Hij values from (0, 1) corresponding to its fuzzy 
constraints in decreasing order. Obviously the number of different /Xy values is not greater than 
m, the total number of all precedence constraints. Since graph G is an acyclic subgraph of Gp, 
the standard binary search will identify the maximal fiij for which the graph G^.j is also acyclic in 
O(logn) time. □ 

Observation 4 If graph G represents crisp precedence constraints and if Gp is acyclic for some 
0 < /? < 1, then there is a generalization of the arc deletion procedure of the third step of the 
Ishii-Tada algorithm such that it can be applied for a transition from Pp to Pq with 0 < a < p. 

Proof. Consider an arbitrary feasible schedule tt for problem Pa with 0 < a < /?. Let jobs Ji and 
Jj be such that the arc (i,j) belongs to Gp, and = 1 and fiji > a. If job Jj is scheduled before 
job Jj in 7 T, then there are k and I such that A; < Z, 7r{k) — x, and 7 t(Z) = j. If job J* is scheduled 
after job Jj in tt, then there are k and I such that k < I, 7r(k) = j, and n{l) = i. It follows that 
in both cases > ck- However, as Ishii and Tada noticed in [6], this implies that an arc (x, j) 

can be discarded from Gp without violating the feasibility condition /Jimin{T^) > Moreover the 
minimum of fmax subject to the precedence constraints given by Gp \{{i,j)} cannot be smaller than 
the minimum of fmax subject to Gp^ because Gp \ {(x, j)} is a subgraph of Gp. This argument can 
be repeated iteratively for all arcs (x,j) of Gp for which iXij = 1 and /Xjj > a, as required by the 
Ishii-Tada arc deletion procedure. □ 

Algorithm Description 

Step 1 [Initialization] 

1.1 Arrange all /x^ from the open interval (0, 1) in a nondecreasing order and rename the 
resulting k different values so that 1 > /x(l) > /x(2) > • • • > /x(A;) > 0. 

1.2 Set q := 0, /x(^) := 1, a := /x(^), L 0. 

1.3 Construct Ga by adding to the crisp precedence graph all arcs (x, j) corresponding to the 
fuzzy constraints with /x^ = 1. 

Step 2. Check if graph Ga is acyclic. If yes, go to Step 5; if no, go to Step 3. 

Step 3. [Binary search] Find x such that /x(x) is the greatest value from among 
/x(l),/x(2), . . . ,/x(A;) such that the corresponding graph is acyclic. Set a := /x(x). 

Step 4. [Solving Pa] Solve problem Pa. If the resulting solution 7Ta is not dominated by any 
schedule from L and TTa is not yet in L, then include it in L. 

Step 5 [Arc deletion] li q < k, then set ^ := g + 1, a := /x(^), construct Ga by deleting from 
all arcs (x,j) with fiji = a, and go to Step 4. If q = k, then stop (the current L 
provides the required set of nondominated solutions). 
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4. Analysis of the algorithm 

In what follows we will assume, without loss of generality, that all fuzzy arcs have different member- 
ship function values. If some arcs have identical values, for example, /i(l) = /x(2) = • • • = //(d), 
then we let = ^x{i) + ci, for i = 1, . . . , d, where c > 0 is a small number and we use ei to break 
ties. 

Theorem The time complexity of the algorithm is 0(m log n + max (n^, A;n^)) where k is the number 
of fuzzy constraints, m is the total number of precedence constraints, and n is the number of jobs. 
Proof Step 1.1 requires 0(A;logn) operations. Step 1.2 and Step 1.3 can be run in 0(m) time. Step 
2 can be done in 0{m) time (see, for example [2]). Step 3 is run no more than logm times, each 
time requiring 0{m) operations. Step 4 requires O(n^), according to the complexity of the Lawler 
algorithm [9] when the scheduling problem is solved from scratch. Then this step is repeated at 
most A; -f 1 times, each time requring O(n^) operations. Consequently the total time for this step is 
0(A:n^). Step 5 requires at most 0(m) time. Thus, the overall time complexity of the algorithm is 
0(mlogn + max(n^, A;n^)). □ 

Corollary 1. {Refinement of the Lawler-Moore algorithm for Lmax)- The time complexity of mini- 
mizing the maximum lateness subject to crisp precedence only is 0{m + nlogn). 

Proof Note that 0{m) time is needed to construct graph TG, the topologically sorted representation 
of the directed acyclic graph G [2]. Let 1,2, ...,n be the new numbers of nodes of G, linearly 
ordered in TG. According to the Lawler-Moore algorithm [11] for minimizing the maximum lateness 
Lmax = maxj(Cj — dj), we need to determine and sort the modified due dates: 

d' = min{dj, min{di| J. G Tj}}, j = 1, . . . , n, 

where Tj is the set of all successors (not necessarily immediate) of j in TG. Let Wj be the set of 
immediate successors of j in TG and Nj be the number of nodes in Wj. Since TG is topologically 
sorted, we can rewrite the definition of d' as follows: 

dj = min{dj, mm{di\Ji e Wj}}, j = l,...,n. 

Let us calculate djj, d'^_^, . . .,d[ in this order. In order to know djj we now need 0(1) time. Further, 
in order to know djj_i we need 0{Nn-i) operations, to know djj _2 we need 0{Nn-2) operations, and 

so on, till we find d[ using 0{Ni) operations. The total time is 0{Ni H h Nn-i) = 0{m). To sort 

the d', we need O (nlogn) time, and this proves the claim. □ 

The estimation refines the Lawler-Moore result [11] in practically important cases when m is less 
than n^, and coincides with it in the case when m is close to n^, for example, if G is complete. 
Corollary 2. {Refinement of the Ishii-Tada algorithm). If G\ is an acyclic digraph in which the 
maximal distance between pairs of its nodes is H, then the algorithm can be run in 0{m\ogn-}- Hk) 
time. 

Proof When applying the above algorithm to the Ishii-Tada problem. Steps 2 and 3 are not exploited, 
while Step 4 requires now only 0{m) operations when the problem is solved from scratch. In addition, 
0{Hk) time is needed, because 0{H) operations are applied to recalculate the modified due dates, 
the latter recalculations being repeated 0{k) times, each time for a new problem Pq. □ 
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The estimation refines the Ishii-Tada result in [6] when m is less than n^, and coincides with it when 
m is O(n^). As an obvious consequence of this corollary, we obtain the following result. 

Corollary 3. ( The case of planar graphs). If, in addition to the assumption of the previous Corollary, 
the precedence graph is planar, then the problem can be solved in 0(nlogn -f Hn) time. 

5. Concluding remarks 

This paper introduces a new practically important class of one-machine scheduling problems, the 
so-called problems with mixed precedence constraints, which include both crisp and fuzzy prece- 
dences. It is shown that the two types of precedences not only have different physical nature but 
also require a different algorithmic treatment in the scheduling problems. The scheduling problem 
under consideration is formulated as a bi-criteria combinatorial optimization problem, and a set of 
nondominated solutions is obtained in strongly polynomial time. Examples of applications in robotic 
automated systems are presented. 

Several extensions of this study may be of practical value. First, it is certainly possible that the 
polynomial algorithm presented in this work can be further improved. Other extension are concerned 
with applying the suggested approach for fuzzy scheduling in several practice-oriented directions: (a) 
more than one machine may be treated with other criteria and scheduling scenarios, like job shop, 
open shop, cyclic shop or general shop, (b) other types of fuzzy constraints and fuzzy input data could 
be involved, (c) synchronization and coordinated scheduling of machines and automated material- 
handling devices (robots) in flexible manufacturing systems are of special interest. 
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Training of Large Three-Layer Perceptrons 
with the Neuro-Computer SYNAPSE 
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Summary: When real-life applications are treated, neural networks are typically computation- 

ally expensive. Here the emphasis is on supervised learning with three-layer perceptrons found 
in many successful applications, both research and commercial. Matrix arithmetic learning algo- 
rithms, e. g. backpropagation or momentum learning, are well-suited for fast execution on com- 
puters with matrix processors. An efficient implementation for the low budget matrix- and neuro- 
computers SYNAPSE 2 and SYNAPSES is outlined. The SYNAPSE 2/3 has a peak performance of 
1.28/2.56 billion multiplications (16 bit fixed-point) or additions (48 bit fixed-point) plus 40 million 
look-up-table calls (16 bit fixed-point) of predefineable functions per second. A comprehensive per- 
formance analysis for the SYNAPSE 2 shows that a significant speedup of a today’s PC’s compute 
power is possible. 

Keywords: Large three-layer perceptrons, supervised learning, neural matrix arithmetic algorithms, 
neural network parallel computing, inexpensive matrix- and neuro-computers, SYNAPSE 2 and 
SYNAPSES. 



Three-layer perceptrons with shortcuts 

Today three-layer perceptrons are daily used for real-world applications, e. g. by renowned interna- 
tionally active financial institutions for 1-52 week EURO STOXX index, DAX index, REX index, 
exchange rate and stock price prediction. We consider three-layer perceptrons, see [1], [13] and 
[16], with shortcuts, with an auxiliary neuron Nq (bias neuron) with Xk,o := 1, with input neurons 
Ni - iVni, with hidden neurons N^+i - A„i+n2» with output neurons Nm+n 2 +i ~ ^ni+n^+m with 
hyperbolic tangent transfer functions 

^k,i •= perceptron input Xk,i for z = 0 , 1, . . . ni, 

/ ni 

Xk,i := tanh ^ 

\j=o 

( »»i+n2 \ 

^ Xkj Wj^i for i = ni H- ri2 + 1, . . . Til + ri2 + ^13. 

) 

For k = 1,2, ...rip the pattern input including bias and output are denoted by Xk G and 

yk € R”®, respectively. Often only one output neuron is advisable to facilitate a perceptron’s 
training (separate perceptrons in the case of more than one output variable). All weights wj^i of the 
weight matrices 







■ * ^ni,ni+l ^ 




^fii+l,ni+fi2+l ^ni+2,fii+ri2+l 


* * ‘^ni+n2,ni+n2+l ^ 


Wn = 


^o,m+2 Wi^m+2 * 


’ * ^ni,ni+2 


, W23 = 


^m+l,ni+n2+2 '*^ni+2,ni4-n2+2 * 


+T 12 , ni +TI 2+2 




\^0,ni+n2 '^l,ni+n2 * 


■ *‘^ni,ni+n2/ 




\‘^ni+l,ni+n2+n3 ^ni+2,m+n2+n3 * 


* *'*^ni4-n2,ni+n2+n3/ 



Xkj I for 2 = Til -h 1 , . . . Til H- ri2 and perceptron output 



( 1 ) 
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Wi2 e (layer 1 layer 2 ), W23 6 R”’-"’ (layer 2 layer 3 ) and 



^3 = 



^0,ni+n2+l 

'W^0,rn+n2+2 



^I,ni+Ti2+1 ’ * ‘ ‘*^ni,m+n2+l \ 

^l,ni+n2+2 * * * ^ni,ni+n2+2 I 



V^ 0 ,ni+n 2 +n 3 '^I,ni+n 2 +n 3 ’ * *'^ni,ni+n 2 +n 3 / 



Wi3 e R”3,ni+1 (shortcuts), are trainable. The pattern set P := {(®i,yi), («2,y2)» • (®np, 2 /np)} 

is divided appropriately into training set Pt and cross-validation set P^ with index sets It and 
{PtUPv = P and PtnPv = 0 ). With W := {Wu, W23, W13) and th and defined as the number of 
training and cross-validation patterns the approximation quality of a perceptron can be estimated 
with the error functions 



m + n 2 +n 3 



(W) = - X! “ yk,i-m-n2? and 



kelt /=ni + n2+l 






ni + n 2 +n 3 



kel« /=ni+n 2 +l 



( 2 ) 

( 3 ) 



First derivative training methods and matrix arithmetic algorithms 

As usual a perceptron is trained iteratively, i. e. et{W) is decreased by adaption of VF, as long as 
e„{W) < et(W) or s*(W^) « £t(W^) holds (prevention of overtraining). Upgraded weights can be 
computed with usually favored first derivative methods 

Wnew = Wold + »? ^Woid, ^ £t (Wold) j AW ^Woid, ^ £t (Wold) j (4) 

with step length 77, e. g. the method of steepest descent with or without momentum term. For the 
elements of the gradient ^£t(W) there holds 



d 

dwij 



ni + n 2 +n 3 Q 

St(W) = i^k,l — ykj-m-m) ^k,l 

k£lt /=m + n 2 +l 

ni+r» 2 +n 3 \ ^ f 

= ~ yk,l-ni-n2) (1 “ ^k,l) o { zL '^m,l^k,m 

k^It /=ni+n2+l ym ni+1 

ni + n2+Ti3 / 2\ ^ 

= {^k,l — yk,l-ni-ri2) (} ~ ^k,l ) o ^k,j 

keh /=ni + n2+l 

ni + Ti2+n3 o\ / o\ 

= i^kyl ~~ yk,l-n\-n^ ~ ^k,l J '^j,l ~ ^kj J ^k,i 

kElt f=ni4-n2+l 



( 5 ) 



for i = 0, 1, . . . ni and j = ni + 1, rii + 2, . . . ni + TI2, and analogously 



— £t{^) — 2/fc,j-ni-Ti2) (1 ^k,j ^ ^kyi (^) 

k£h 

for i = 0, 1, . . . ni, ni + 1, «i + 2, . . . ni + ri2 and j = ni + fi2 + 1 , ni + na + 2 , . . . «i + TI2 + 713. When 
real-life applications are treated, neural networks are typically computationally expensive 

• because the dimension ni and/or 713 of the input and/or output vector is large and/or 
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Figure 1. SYNAPSE 2 PCI coprocessor board with one MA16 matrix processor, 8 + 16 
MByte on board memory and « 20 MByte/sec data transfer rate SYNAPSE 2 PC. 



• because the number ri 2 of hidden neurons necessarily is large and/or 

• because the number rip of patterns is huge and/or the pattern set must be split appropriately 
into subsets which are used to train different neural (sub-)networks, see [3] and [5], and 

• because no convergence to the global minimum of the error function €t{W) can be ensured and 
the training with very many random weight initializations is necessary. 

The optimization of percept ron topology and weights generally requires the training of hundreds, 
thousands or ten thousands of networks. Different topologies, different weight initializations and 
different step lengths must be tried. The more perceptrons are trained, the more likely it is to train 
at least one of the perceptrons successfully, i. e. to find a good local minimum of et{W). The com- 
puting power today can be provided by high performance computers, e. g. SIMD and MIMD parallel 
computers and specifically developed analog and digital neuro-computers. In contrast to parallel 
computers PC neuro-computer boards are rather inexpensive. Common for all training and evalua- 
tion algorithms for perceptrons on digital neuro-computers is the parallel computability of first (and 
second) partial derivatives of £:t(VF) w. r. t. wj^i, see [4], [8], [10], [11], [15] and [18]. Here we focus 
on the SYNAPSE 2 and SYNAPSE 3 PCI coprocessor boards — SYNAPSE = Synthesis of neural 
algorithms on a parallel systolic engine — with one and two Siemens MA16 matrix processor(s) and 
C++ class library SynUseBase, see Figs. 1 and 2, [9], [12] and [14]. One SYNAPSE 3 board includ- 
ing the SynUseBase library is available for less than DM 16000. SynUseBase requiring advanced 
C++ programming and knowledge of training algorithms enables extremely fast and flexible imple- 
mentations for many types of neural networks, see [2], [4], [5], [6], [7], [9] and [14]. The SYNAPSE 2/3 
has a peak performance of 1.28/2.56 billion multiplications (16 bit fixed-point) /additions (48 bit fixed- 
point) plus 40 million look-up- table calls (16 bit fixed-point) of predefineable functions per second. 
For multiplication and addition of large (sub-) matrices and adjacent evaluation of few predefined 
functions the SYNAPSE 3 is up to 60 times faster than a PC with 300 MHz Pentium II processor. 
Therefore any algorithm that uses matrix arithmetic is well-suited for fast execution. Besides image 
processing, neural network applications with large networks and/or large high-dimensional pattern 
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Figure 2. SYNAPSE 3 PCI coprocessor board with two MA 16 matrix processors, 16 4 - 
16 MByte on board memory and « 35 MByte/sec data transfer rate SYNAPSE 3 ^ PC. 

sets can be treated much more efficiently. To compute £t{W) and ^ £t{W) with a matrix arithmetic 



algorithm we define 



h — {^1)^ 

with the 







^ ^t2>0 






\ = 


^t2>l 








^i2,ni * 
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€ R"‘+ 1 '"S y € 



, Y-={ynyn -Vi.) = 



( Viui Vi2,i ■ ■ ■ 3/i„„i 
Vii,2 2/t2,2 • • • Vint, 2 



Kvii ,n3 2/i2,n3 * * * yinti^sJ 

5 the forwardpropagation, see Eqs. ( 1 ) 



A2 
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= tanh {N2) = tanh {W12 Ai) — 





1 tanh 1 
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tanhl E wi,ni+2Xit,i 
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1 tanh 1 
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• • • tanh 1 


/ni 
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[tanhl ^i,ni+Ti2 

\ \l=0 


^ tanh^ 


'ni \ 
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U=0 / 


• • • tanh ^ 
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./=0 * 



A2, A2 G and analogously compute 

^ ^tl,ni+n2 + l ^t2>»»l+W2 + l * * * ^*nf,Wi+n2 + l ^ 
. ^ii,ni+n2+2 ^i2,ni+n2+2 ‘ ' ’ ^int,T»i+n2+2 

Az— , . . 



= tanh (A3) = tanh (W23 A2 + IY13 Ai), 
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Ns, As € We compute the perception’s error for the training patterns 

/ ^»i,ni+n2+l “ ^t2,ni+n2+l ■” 2/t2,l *'* ni + 112 + 1 ~ 2/int,l ^ 

^*i,ni+n2+2 ■“ 2 /»i,2 ^i2,ni+ri2+2 ~ 2/*2,2 ’ * ' ^int,ni+n 2+2 ~ 



Ss:=As-Y = 



J 



\^ti,ni+n2+n3 2/*i,n3 ^*2,ni+n2+n3 J/»2,n3 *** 1 + 112+113 Vint^nz J 

Ss € and compute the training error function, see Eq. ( 2 ), with the elementary matrix 

multiplication (denoted by O) 

^ (^ti,m+n2+l 1 1» 1+112 + 1 2/int,l) ^ /l\ 



e,(M/) = J^( 53 © 53 )l = (n---^) 



1 

w 



\(^*l,ni+n 2 +n 3 2/11,113) * ' ' 111+112+113 l/tntjlls) / 

\ e R”® and 1 € R”*. The cross-validation error Sv{W), see Eq. ( 3 ), is analogously computed 
replacing the training patterns in A\ and Y by the cross-validation patterns and using € R”^ 
and 1 G R”’'. For the matrix arithmetic backpropagation, see Eqs. ( 5 ) and (6), we use the elementary 
derivative tanh' = 1 — tanh^ of the hyperbolic tangent elementary matrix function and compute 



D, 


= tanh' (iV3) , 


Z ?3 € 


Ez 


= SsQ)Ds, 


Es € 


S 2 


= Wzz'^Ez, 


52 e R"”"‘, 


D 2 
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E 2 


= S 2 ©i^ 2 , 


E2 e R"^-”', 



and thus can compute 
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Standard backpropagation learning with a priori chosen step lengths 7712,7/13,7723 G R yields the 
upgraded weight matrices 



1 ^ 12 , new •= W^12,old ~ ll 2 G\ 2 , Wi 3 ,new W^ 13 ,old “ ^13 ^ 13 , ^ 23 , new ^ 23 , old “ V 23 G^ 23 - ( 7 ) 
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A comprehensive performance analysis for the SYNAPSE 2, see Tab. 1 and [ 2 ], has been done for all 
perceptron topologies with ni + 1 = 2, 4, 8, 16, 32, 64, 128, ri 2 = 4, 16, 64, 256 and nz = 1, 2, 4, 16, 64. 
For rit = 1000 training patterns 500 weight upgrades, see Eq. (7), have been computed. On a PC with 
300 MHz Pentium II processor the perceptrons have been trained with the Stuttgart Neural Network 
Simulator SNNS as an example of a widely used public domain simulator with graphical interface. 
The SNNS provides an easily portable and highly flexible simulation environment for research on 
and application of neural networks, see [17]. In summary, we found 

• that the speedup mainly depends on the numbers rii and ri 2 of input and hidden neurons, 

• that a satisfactory speedup only is possible for very large perceptrons, 

• that the peak performance has been achieved only for the multiplication of large matrices, but 
by far not for elementary matrix functions, e. g. + and O, and 

• that the data transfer rate of « 20 MByte/sec does not allow efficient distributed computing. 

For many supervised learning problems perceptrons with only one output neuron and only few 
hidden neurons are advisable. Then 4, 8 , 16, 32 or 64 identical perceptrons with different weight 
initializations should be combined to one large perceptron using dummy weights in W 23 . These 
dummy weights are “frozen” to 0, i. e. they must not be trained. The combined large perceptron 
has rii H- 1 input neurons, 4 722 , 8 ri 2 , 16ri2, 32ri2 or 64 722 hidden neurons and 4723 , 8723 , 16 723 , 32 723 
or 64 723 output neurons, respectively. “Frozen weights” can be assigned with a SynUseBase library 
function. Due to two MA16 matrix processors the newly developed SYNAPSE 3 has a doubled 
peak performance of 2.56 billion multiplications or additions plus 40 million look-up-table calls. The 
increased data transfer rate of « 35 MByte/sec between SYNAPSE 3 board and PC now enables 
faster distributed computing (elementary matrix functions). Up to three parallel SYNAPSE 3 boards 
efficiently can be used in one PC. First results in the first authors group are very promising. 

Conclusions 

Computationally expensive large three-layer perceptrons can be trained and evaluated fast with the 
inexpensive neuro-computer PC-boards SYNAPSE 2 and SYNAPSE 3. The error function and its 
gradient are computed with matrix arithmetic algorithms which enable a peak performance of more 
than 10® fixed-point operations per second. This performance is comparable to the performance of 
today’s parallel supercomputers with 20 - 30 processors. The comparison between the SYNAPSE 2 
and a PC with 300 MHz Pentium II processor shows a maximum speedup by the factor 30. Nev- 
ertheless for three parallel SYNAPSE 3 boards a speedup by the factor 100 is possible. The first 
author’s current SYNAPSE 3 research projects are dedicated 

• to an efficient distributed computing for the PC’s processor and up to three parallel SYNAPSE 3 
boards, 

• to the fast gradient computation for advanced training methods, e. g. sequential quadratic 
programming (SQP) methods and generalized Gaufi-Newton (GGN) methods, and 

• to affordable personal neuro-computer applications, e. g. for financial time series analysis and 
forecast and heuristic option pricing with neural networks. 
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Topology 


SYNAPSE 2 


300 MHz Pentium II 


# 


Til + 1 


ri2 


^3 


time [sec] 


time [sec] 


speedup 


1 


8 


256 


1 


211 


592 


2.81 


2 


32 


64 


1 


79 


324 


4.10 


3 


32 


256 


1 


203 


1498 


7.38 


4 


128 


16 


1 


56 


368 


6.57 


5 


128 


64 


1 


99 


1562 


15.80 


6 


128 


256 


1 


272 


6871 


25.26 


7 


8 


256 


4 


222 


779 


3.51 


8 


32 


64 


4 


79 


362 


4.58 


9 


32 


256 


4 


206 


1682 


8.17 


10 


128 


16 


4 


56 


379 


6.77 


11 


128 


64 


4 


99 


1580 


15.96 


12 


128 


256 


4 


273 


7029 


25.75 


13 


2 


64 


16 


57 


289 


5.07 


14 


2 


256 


16 


183 


1348 


7.37 


15 


8 


16 


16 


28 


79 


2.82 


16 


8 


64 


16 


58 


330 


5.69 


17 


8 


256 


16 


195 


1573 


8.07 


18 


32 


16 


16 


29 


130 


4.48 


19 


32 


64 


16 


61 


530 


8.69 


20 


32 


256 


16 


217 


2806 


12.93 


21 


128 


16 


16 


38 


420 


11.05 


22 


128 


64 


16 


85 


2203 


25.92 


23 


128 


256 


16 


259 


7783 


30.05 


24 


2 


16 


64 


60 


228 


3.80 


25 


2 


64 


64 


111 


962 


8.67 


26 


2 


256 


64 


335 


4644 


13.86 


27 


8 


16 


64 


62 


239 


3.85 


28 


8 


64 


64 


108 


1012 


9.37 


29 


8 


256 


64 


304 


4934 


16.23 


30 


32 


16 


64 


61 


289 


4.74 


31 


32 


64 


64 


112 


1260 


11.25 


32 


32 


256 


64 


323 


6104 


18.90 


33 


128 


16 


64 


82 


598 


7.29 


34 


128 


64 


64 


135 


2521 


18.67 


35 


128 


256 


64 


373 


11752 


31.51 



Table 1. Performance comparison: topology number (1st column), number of 
input, hidden and output neurons (2nd, 3rd and 4th column), computing time 
in seconds on the SYNAPSE 2 (5th column) and on a PC with 300 MHz Pen- 
tium II processor (6th column) and speedup of the SYNAPSE 2 (last column). 
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Summary: When establishing a virtual private network (VPN), a telecommunication provider has 
to balance two conflicting goals. On the one hand, the customer requires reliability, on the other 
hand, the provider has to keep costs at a minimum. This article presents two related approaches for 
the optimal computational solution of the embedding of a VPN which is crucial in modern network 
planning. 

1. Introduction 

In telecommunications engineering, there is more to do than accelerating transmission or constructing 
safer encryption codes. Raising a large technological system typically shows an increasing risk of 
failure due to unforeseen influences, inherent inconsistencies, or lack of control despite all efforts. 
Keeping their networks stable and operable are the primary goals of all telecommunication providers. 
Therefore, safety measures are being called for by their customers. A typical example in this context 
is the routing of a virtual private network (VPN). 

A VPN, as the name indicates, is a network that appears to be exclusively controlled and operated by 
a customer company alone. In reality, it consists of a number of permanent lines that are leased from 
a carrier. So control also lies upon the latter to a significant extent. And for this reason, the carrier 
has to provide these lines subject to a reliability demand. This means that one failure in the carrier’s 
network may at most cause one failure on the VPN level, in the customer’s network. Technically 
speaking, the different leased lines must be routed disjointly through the carrier’s network. 

In order to be competitive, the carrier must also consider the cost side of this demand. So naturally, 
a reliable embedding of minimum cost is required. The problem of finding this optimum is the 
mathematical task we treat in this paper. 

The basis for this paper was a joint R&D project between the group Combinatorial Optimization and 
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Graph Algorithms at TU Berlin and the Network Optimization group at Deutsche Telekom AG that 
has taken place from July 1995 through June 1996. It was initiated by earlier investigations of the 
Deutsche Telekom AG [6]. One of the authors has written his Diploma thesis [4] about this project 
containing all presented aspects in detail, along with many examples and important extensions. A 
more elaborated journal paper [3] is in preparation, featuring more concepts and details of this 
research. 



Terminology 

Throughout this text, we will refer to the physical network as the base graph G — (V,E), and to 
the requested connections as the demand graph H = (T, D), T CV. The elements of V and E are 
called the nodes and edges of G, respectively. An element d = {s,t} £ D is called a demand. Both 
input graphs are multi-graphs which may contain parallel edges or demands, respectively, but no 
loops. For examples of such data, see Figures 1 and 2. 




Figure 1: A physical network Figure 2: A virtual private network 

(base graph) (demand graph) 



In our project, we investigated the mathematical and algorithmic aspects of the so-called Minimum 
Cost Disjoint Paths Problem in two versions, node- and edge-disjoint, respectively. The latter reads 
as follows: 
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Minimum Cost Disjoint Paths Problem (MCDPP) (edge-disjoint version) 



Given a base graph G = (V, E) with associated edge costs c: E and a demand 

graph H = (T, D) where T CV. For each demand d = {s, t}, find an embedding in 
G as a simple s-t-path E^ C E such that all paths are mutually edge-disjoint and the 
total cost of all edges used is a minimum: 

min 

d£D egB** 

s. t. E^nE^ = 0 for all rf,d' € £>,d ^ d'. 



The node-disjoint version differs from this one in that the paths may at most mutually share their 
terminals. For these problems, we ask for only one optimal solution and do not require to produce 
each of them, in case there are more than one. 

An abundance of theoretical results is available on the topic of disjoint routing. The most important 
ones are the NP-hardness of the problem, and the existences of feasible solutions as well as polynomial 
time algorithms in highly restricted settings. A large number of them are surveyed in [4]. 

2. Lower Bounds on Costs 

The key ingredients we need for designing a suitable search method are strong lower and upper 
bounds on the optimal objective function values. In our case we generate feasible solutions and 
hence upper bounds by means of a suitable heuristics guided by structural information that comes 
along with our lower bounds. 

In the following discussion, we concentrate on the computation of lower bounds. We denote a lower 
cost bound on the optimal objective value of an (MCDPP) instance (G, H) with cost function c by 
LBe(G,7f). 

2.1 The Model 

The min-cost VPN embedding problem may be considered as a special case of the general multi- 
commodity flow problem. The latter consists of a certain number of individual minimum cost flow 
problems, tied together by so-called bundle constraints (for an introduction see [1]). These con- 
straints typically limit the amount of fiow on an arc of the underlying network up to a joint capacity 
value. In our case, this value is uniformly equal to one on all the arcs, in addition to all arc flows 
being restricted to the binary values of {0, 1}. This forces the desired disjointness. 

We now model the min-cost VPN embedding problem as an integer linear program. To this end, 
we formalize three types of constraints, which are reflected in the corresponding (in) equalities in the 
model: 

(i) all demands are flows between their associated terminals, 

(ii) all edges have got bundle capacities of one, 

(iii) all flow values are binary. 
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In order to get a network N = (V, A) of directed arcs, we split all edges into two anti-parallel arcs 
with costs equal to c(e): 



A := { (u, u) I {u, u} G jE? } U { (u, u) \ {u,v} £ E} . 

Each demand d £ D represents a commodity, and the vector of all variables describing its flow values 
is denoted by x^. For each d = {s, t}, the balance vectors £ {—1,0, are given by: 



1 if u = s, 

6‘*(d) := ^ -1 if t) = t, 

0 otherwise. 

Finally, M £ (—1,0, denotes the node-arc incidence matrix where 

1 if a = (u, v) for some u, 

— 1 if a = {v^u) for some u, 

0 otherwise. 

The edge-disjoint model of (MCDPP) now reads as follows: 



Model (ILP) 



min EE c(a)x^(a) 

deD aeA 

s. t. (i) Mx^ = b^ for all d G D 

(ii) ^^{x^(u,v) -\-x^{VyU)) < 1 for all {u,v} £ E 

deD 

(iii) x^ £ (0, 1}'^' for all d£D. 



Here, (i) models flow conservation (Kirchhoff’s Law), and (ii) the joint capacity constraints. 

The node-disjoint model differs from the above only in constraint (ii) which is transformed to nodes 
instead of edges in order to control the flow through nodes. 



2.2 Lagrangian Relaxation 

We assume that the reader is familiar with the basics of Lagrangian relaxation, subgradient optimiza- 
tion, and their properties, see [1, 2]. Lagrangian relaxation applied to the disjointness constraint (ii) 
of Model (ILP) reads: 



Model (LRa) 



min E E (c(u, v) 4- A(u, v)){x^{u, v) 4- x"^{v, u)) - ^ A(u, v) 

deD {u,v}eE {u,v}eE 

s. t. (i) Mx^ = 6“' for all d e D 

(iii) x‘‘ e {0, for all deD. 



Now for any value of vector A > 0, the remaining problem is to And a minimum cost flow for all 
demands d £ D independently horn each other with respect to the new cost function c4-A. Apparently, 
this is equivalent to \D\ individual shortest path problems. 
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3. The Branch-and-Bound Method 

Our first solution method is based on the branch-and-bound paradigm. For an introduction to this 
principle, see for instance, [5]. 

We note that a branch-and-bound approach is well suited for the solution of (MCDPP): we have 
fast methods for computing good lower bounds, and we are able to partition each solution set into 
disjoint subsets representing again (MCDPP) type problems. 

3.1 Branching Strategy 

We can achieve a disjoint partitioning of the partial solution set in a search tree vertex by observing 
that every demand d = {s, t} must be embedded using exactly one edge e incident to its terminal 
s. Hence, we choose every neighboring edge e once as a candidate for the embedding where we have 
chosen the terminal s at random from the two terminals of d. Then, we partition the set of all 
solutions into subsets differing by these edges e which are respectively added to the embedding of d 
(and thereby excluded from the embeddings of all other demands). 




Figure 3; Reduced problem ( G' — e, H' - d-{- {u, t} ) after embedding edge e 
for demand d. 

The node-disjoint case differs only slightly in that we reduce the base graph to G' - v instead of 
G'-e. 

3.2 Bounding Subproblems 

We test whether a subproblem (G', H') at hand can possibly deliver a global optimal solution by 
comparing 

(*) cost{E \ E') + LBc(G\ H’) < cost(Sbest) 

where E, E' are the edge sets of G, G', respectively. The term E\E' means all edges hitherto 
embedded for some demand, and cost(Sbest) is the cost of the actual found best solution Sbest- 
Obviously, this is not the case otherwise. 

4. The Backtracking Method 

Our second method is related to branch-and-bound. It partitions the set of all solutions into smaller 
subsets, yet the divisions are made differently. The general idea is to simultaneously reduce G and 
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H in every step, thereby recursively defining a search tree. For this purpose, we successively embed 
entire paths which are delivered by an appropriate path generation sub-algorithm, one at a time. 
On every level of the search tree, we attempt to solve an (MCDPP) type problem by 

embedding one demand from if' successively by all paths that are possible in G'. With each of these 
paths, we define a subproblem by removing its nodes or edges from G' (depending on the disjoint ness 
mode) and the embedded demand from H'. The bounding works analogously to equation (*). 

5. Conclusion 

Confronted with a real-world problem of the Deutsche Telekom AG, we have investigated the op- 
timization problem (MCDPP) on arbitrary base and demand graphs in both versions, edge- and 
node-disjoint. Due to their NP-hardness, we pursued exact approaches lacking a polynomial time 
guarantee. 

We have set up and implemented a method for computing lower bounds on the optimal objective 
value of (MCDPP), representing the total embedding cost. They are the core ingredients in both 
practical algorithms we have devised for obtaining optimal solutions to the embedding problems. 
Subsequently, we have developed two combinatorial codes, one using a branch-and-bound method 
and a backtracking type one. While the former approach had to be carefully adapted in order to 
actually produce disjoint paths, the backtracking method is based on entire paths generated by an 
appropriate subroutine. 

Both methods have the advantage that execution can be terminated at any time while retaining the 
best feasible solution hitherto found. This may be automated by finishing when the total cost cost(S) 
of a solution S drops below a certain percentage p above the global lower cost bound LBc{G, H). 
We performed extensive experiments on several example classes which are from or close to real-world 
situations. Throughout all our test runs, we have used the lower cost bounds LBc that were always 
taken from the Lagrangian relaxation approach using subgradient optimization. 

We have benchmarked both of our algorithms against the standard LP and IP solver cpiex, version 
3.0. This is a general tool for arbitrary optimization problems encoded by linear models. It does not 
explicitly take into account the graph-theoretical structure of our problem (MCDPP), but exclusively 
operates with the model variables. We observed in general that our algorithms mostly excelled in 
performance over cpiex. Some significant results can be seen in Table 1 below. Note that, although 
in the cases considered feasible solutions always exist, this need not necessarily be true in general. 
Methods for checking the existences of feasible solutions are described in [3, 4]. 

The code developed and implemented in this project has been integrated into a toolbox which is 
used by the Deutsche Telekom AG. 
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time [min] 


size of 
base graph 


back- 

tracking 


branch- 

and-bound 


cpiex 


exampleGe 


185 


0.0 


0.0 


0.3 


exampleGn 


nodes 


0.1 


0.3 


0.2 


example 19e 


354 


1.5 


2.0 


8.1 


example 19n 


edges 


2.4 


0.9 


16.9 


example42e 




0.2 


0.1 


60.1 


example42n 




0.7 


0.3 


20.9 


examplell7e 


676 n. 
1107 e. 


2.6 


2.7 


> 500 


gerl7e 


164 


1.4 


7.1 


6.5 


gerl7n 


nodes 


0.0 


0.6 


5.4 


ger73e 


386 


0.1 


0.2 


186.6 


ger73n 


edges 


0.0 


0.0 


8.4 


gerl6e 


434 


2.3 


8.2 


29.5 


gerl6n 


nodes 


19.3 


33.8 


22.1 


ger28e 


978 


3.6 


13.5 


93.9 


ger28n 


edges 


> 200 


94.8 


151.8 


usa7e 


176 


0.0 


0.2 


0.2 


usa7n 


nodes 


0.5 


0.6 


2.8 


usal2e 


314 


0.1 


0.1 


2.7 


usal2n 


edges 


0.4 


0.1 


2.0 


usa26e 


617 


0.2 


0.1 


83.8 


usa26n 


nodes 


8.4 


2.3 


100.0 


usa37e 


1039 


2.8 


1.8 


304.2 


usa37n 


edges 


0.1 


0.0 


182.7 



Table 1: Running times of our (MCDPP) codes compared to cpiex 3.0. The 
postfix ‘e’ refers to the edge-disjoint case, ‘n’ to the node-disjoint one while the 
decimals form the numbers of demands. Entries of “0.0” mean “below 6 seconds.” 
All times were measured on a Sun SPARC 20 machine under SunOS 4.1.4. 




580 



time [sec] 


cheapest paths 


Lagrangian 


LP-relaxed 


optimum 




bound 


time 


bound 


time 


bound 


time 


value 


time 


exampleGe 


3567 


0.0 


3643.0 


0.2 


3643.0 


12.7 


3643 


17.8 


example6n 


3567 


0.0 


3673.5 


0.5 


3673.5 


10.3 


3677 


11.4 


example 19e 


6918 


0.0 


7688.0 


1.4 


7688.0 


483 


7688 


483 


example 19n 


6966 


0.0 


8303.1 


1.2 


8303.5 


662 


8462 


1011 


example42e 


8263 


0.0 


8707.0 


1.5 


8707.0 


3605 


8707 


3605 


example42n 


8263 


0.0 


9102.6 


1.7 


9102.7 


1237 


9105 


1251 


example 117e 


10478 


0.03 


11302.9 


4.2 


11303.0 


> 10^ 


11303 


> 10^ 


gerl7e 


54215 


0.0 


56336.5 


1.3 


56340.5 


208 


56467 


390 


gerl7n 


54679 


0.0 


58949.0 


0.3 


58949.0 


321 


58949 


321 


ger73e 


88969 


0.0 


92132.0 


1.7 


92132.0 


11198 


92132 


11198 


ger73n 


89601 


0.0 


98146.0 


1.1 


98146.0 


502 


98146 


502 


gerl6e 


53321 


0.0 


55265.8 


2.9 


55266.0 


1771 


55266 


1771 


gerl6n 


53423 


0.0 


57024.2 


2.4 


57086.0 


1323 


57086 


1323 


ger28e 


80356 


0.0 


83094.8 


4.6 


83095.0 


5634 


83095 


5634 


ger28n 


80125 


0.0 


87348.0 


5.3 


87404.4 


8903 


87498 


9108 


usa7e 


60810 


0.0 


61498.8 


0.6 


61499.0 


11 


61499 


11 


usa7n 


61214 


0.0 


66354.0 


0.6 


66398.0 


8 


69942 


168 


usal2e 


67835 


0.0 


69898.9 


0.5 


69899.0 


68 


69899 


162 


usal2n 


67835 


0.0 


73092.4 


0.7 


73104.7 


43 


74556 


121 


usa26e 


104293 


0.0 


109493.7 


3.9 


109494 


5029 


109494 


5029 


usa26n 


105108 


0.0 


123412.4 


3.5 


123601.1 


2659 


123835 


5997 


usa37e 


105412 


0.0 


112486.0 


2.1 


112486.0 


18036 


112486 


18252 


usa37n 


105412 


0.0 


121550.0 


2.0 


121550.0 


10961 


121550 


10961 



Table 2: Comparison of different lower cost bounds: values and computation 
times are given. The LP-relaxed and optimum bounds were computed with 
cpiex, the cheapest paths with the same Dijkstra-subroutine underlying the La- 
grangian relaxation. It can be seen that the Lagrangian bounds represent a good 
compromise between accuracy and time. 
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Figure 4: Optimal solutions to (MCDPP) on the example given in Figures 1 and 2. Left: 
node-disjoint case, right: edge-disjoint case. 
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